hello
hello

📌S Retain class distribution for seed 10:
Class 0: 4500
Class 1: 4500
Class 2: 4500
Class 3: 4500
Class 4: 4500
Class 5: 4500
Class 6: 4500
Class 7: 4500
Class 8: 4500
Class 9: 4500

📌S Forget class distribution for seed 10:
Class 0: 500
Class 1: 500
Class 2: 500
Class 3: 500
Class 4: 500
Class 5: 500
Class 6: 500
Class 7: 500
Class 8: 500
Class 9: 500
72

📊 Updated class distribution:
Retain set:
  Class 0: 4875
  Class 1: 4875
  Class 2: 4875
  Class 3: 4875
  Class 4: 4875
  Class 5: 4875
  Class 6: 4875
  Class 7: 4875
  Class 8: 4875
  Class 9: 4875
Forget set:
  Class 0: 125
  Class 1: 125
  Class 2: 125
  Class 3: 125
  Class 4: 125
  Class 5: 125
  Class 6: 125
  Class 7: 125
  Class 8: 125
  Class 9: 125
hello
hello
⚠️ Warning: Retain train loader may not be shuffled.
Training Epoch: 1 [256/48750]	Loss: 2.4208	LR: 0.000000
Training Epoch: 1 [512/48750]	Loss: 2.4631	LR: 0.000524
Training Epoch: 1 [768/48750]	Loss: 2.4837	LR: 0.001047
Training Epoch: 1 [1024/48750]	Loss: 2.4167	LR: 0.001571
Training Epoch: 1 [1280/48750]	Loss: 2.3532	LR: 0.002094
Training Epoch: 1 [1536/48750]	Loss: 2.2551	LR: 0.002618
Training Epoch: 1 [1792/48750]	Loss: 2.1172	LR: 0.003141
Training Epoch: 1 [2048/48750]	Loss: 1.9557	LR: 0.003665
Training Epoch: 1 [2304/48750]	Loss: 1.6764	LR: 0.004188
Training Epoch: 1 [2560/48750]	Loss: 1.5010	LR: 0.004712
Training Epoch: 1 [2816/48750]	Loss: 1.2860	LR: 0.005236
Training Epoch: 1 [3072/48750]	Loss: 0.9972	LR: 0.005759
Training Epoch: 1 [3328/48750]	Loss: 0.8123	LR: 0.006283
Training Epoch: 1 [3584/48750]	Loss: 0.7514	LR: 0.006806
Training Epoch: 1 [3840/48750]	Loss: 0.6055	LR: 0.007330
Training Epoch: 1 [4096/48750]	Loss: 0.4422	LR: 0.007853
Training Epoch: 1 [4352/48750]	Loss: 0.3234	LR: 0.008377
Training Epoch: 1 [4608/48750]	Loss: 0.3228	LR: 0.008901
Training Epoch: 1 [4864/48750]	Loss: 0.2842	LR: 0.009424
Training Epoch: 1 [5120/48750]	Loss: 0.2933	LR: 0.009948
Training Epoch: 1 [5376/48750]	Loss: 0.2469	LR: 0.010471
Training Epoch: 1 [5632/48750]	Loss: 0.2344	LR: 0.010995
Training Epoch: 1 [5888/48750]	Loss: 0.2236	LR: 0.011518
Training Epoch: 1 [6144/48750]	Loss: 0.2288	LR: 0.012042
Training Epoch: 1 [6400/48750]	Loss: 0.1021	LR: 0.012565
Training Epoch: 1 [6656/48750]	Loss: 0.2315	LR: 0.013089
Training Epoch: 1 [6912/48750]	Loss: 0.1681	LR: 0.013613
Training Epoch: 1 [7168/48750]	Loss: 0.1775	LR: 0.014136
Training Epoch: 1 [7424/48750]	Loss: 0.1587	LR: 0.014660
Training Epoch: 1 [7680/48750]	Loss: 0.1213	LR: 0.015183
Training Epoch: 1 [7936/48750]	Loss: 0.2066	LR: 0.015707
Training Epoch: 1 [8192/48750]	Loss: 0.1213	LR: 0.016230
Training Epoch: 1 [8448/48750]	Loss: 0.2015	LR: 0.016754
Training Epoch: 1 [8704/48750]	Loss: 0.1547	LR: 0.017277
Training Epoch: 1 [8960/48750]	Loss: 0.2462	LR: 0.017801
Training Epoch: 1 [9216/48750]	Loss: 0.2325	LR: 0.018325
Training Epoch: 1 [9472/48750]	Loss: 0.2239	LR: 0.018848
Training Epoch: 1 [9728/48750]	Loss: 0.2012	LR: 0.019372
Training Epoch: 1 [9984/48750]	Loss: 0.2168	LR: 0.019895
Training Epoch: 1 [10240/48750]	Loss: 0.2882	LR: 0.020419
Training Epoch: 1 [10496/48750]	Loss: 0.1909	LR: 0.020942
Training Epoch: 1 [10752/48750]	Loss: 0.3044	LR: 0.021466
Training Epoch: 1 [11008/48750]	Loss: 0.3162	LR: 0.021990
Training Epoch: 1 [11264/48750]	Loss: 0.3129	LR: 0.022513
Training Epoch: 1 [11520/48750]	Loss: 0.1465	LR: 0.023037
Training Epoch: 1 [11776/48750]	Loss: 0.1826	LR: 0.023560
Training Epoch: 1 [12032/48750]	Loss: 0.1055	LR: 0.024084
Training Epoch: 1 [12288/48750]	Loss: 0.2351	LR: 0.024607
Training Epoch: 1 [12544/48750]	Loss: 0.1963	LR: 0.025131
Training Epoch: 1 [12800/48750]	Loss: 0.2301	LR: 0.025654
Training Epoch: 1 [13056/48750]	Loss: 0.1874	LR: 0.026178
Training Epoch: 1 [13312/48750]	Loss: 0.2355	LR: 0.026702
Training Epoch: 1 [13568/48750]	Loss: 0.2885	LR: 0.027225
Training Epoch: 1 [13824/48750]	Loss: 0.2297	LR: 0.027749
Training Epoch: 1 [14080/48750]	Loss: 0.2008	LR: 0.028272
Training Epoch: 1 [14336/48750]	Loss: 0.2927	LR: 0.028796
Training Epoch: 1 [14592/48750]	Loss: 0.3263	LR: 0.029319
Training Epoch: 1 [14848/48750]	Loss: 0.1703	LR: 0.029843
Training Epoch: 1 [15104/48750]	Loss: 0.1767	LR: 0.030366
Training Epoch: 1 [15360/48750]	Loss: 0.2450	LR: 0.030890
Training Epoch: 1 [15616/48750]	Loss: 0.3389	LR: 0.031414
Training Epoch: 1 [15872/48750]	Loss: 0.1490	LR: 0.031937
Training Epoch: 1 [16128/48750]	Loss: 0.1735	LR: 0.032461
Training Epoch: 1 [16384/48750]	Loss: 0.2760	LR: 0.032984
Training Epoch: 1 [16640/48750]	Loss: 0.1706	LR: 0.033508
Training Epoch: 1 [16896/48750]	Loss: 0.2440	LR: 0.034031
Training Epoch: 1 [17152/48750]	Loss: 0.2966	LR: 0.034555
Training Epoch: 1 [17408/48750]	Loss: 0.1991	LR: 0.035079
Training Epoch: 1 [17664/48750]	Loss: 0.2628	LR: 0.035602
Training Epoch: 1 [17920/48750]	Loss: 0.1271	LR: 0.036126
Training Epoch: 1 [18176/48750]	Loss: 0.2029	LR: 0.036649
Training Epoch: 1 [18432/48750]	Loss: 0.1551	LR: 0.037173
Training Epoch: 1 [18688/48750]	Loss: 0.1951	LR: 0.037696
Training Epoch: 1 [18944/48750]	Loss: 0.1925	LR: 0.038220
Training Epoch: 1 [19200/48750]	Loss: 0.1665	LR: 0.038743
Training Epoch: 1 [19456/48750]	Loss: 0.2031	LR: 0.039267
Training Epoch: 1 [19712/48750]	Loss: 0.3016	LR: 0.039791
Training Epoch: 1 [19968/48750]	Loss: 0.1899	LR: 0.040314
Training Epoch: 1 [20224/48750]	Loss: 0.2162	LR: 0.040838
Training Epoch: 1 [20480/48750]	Loss: 0.1687	LR: 0.041361
Training Epoch: 1 [20736/48750]	Loss: 0.3084	LR: 0.041885
Training Epoch: 1 [20992/48750]	Loss: 0.1832	LR: 0.042408
Training Epoch: 1 [21248/48750]	Loss: 0.2206	LR: 0.042932
Training Epoch: 1 [21504/48750]	Loss: 0.2027	LR: 0.043455
Training Epoch: 1 [21760/48750]	Loss: 0.1306	LR: 0.043979
Training Epoch: 1 [22016/48750]	Loss: 0.2392	LR: 0.044503
Training Epoch: 1 [22272/48750]	Loss: 0.2517	LR: 0.045026
Training Epoch: 1 [22528/48750]	Loss: 0.1698	LR: 0.045550
Training Epoch: 1 [22784/48750]	Loss: 0.2321	LR: 0.046073
Training Epoch: 1 [23040/48750]	Loss: 0.2629	LR: 0.046597
Training Epoch: 1 [23296/48750]	Loss: 0.1176	LR: 0.047120
Training Epoch: 1 [23552/48750]	Loss: 0.2197	LR: 0.047644
Training Epoch: 1 [23808/48750]	Loss: 0.1732	LR: 0.048168
Training Epoch: 1 [24064/48750]	Loss: 0.1076	LR: 0.048691
Training Epoch: 1 [24320/48750]	Loss: 0.1494	LR: 0.049215
Training Epoch: 1 [24576/48750]	Loss: 0.2070	LR: 0.049738
Training Epoch: 1 [24832/48750]	Loss: 0.1793	LR: 0.050262
Training Epoch: 1 [25088/48750]	Loss: 0.1959	LR: 0.050785
Training Epoch: 1 [25344/48750]	Loss: 0.1588	LR: 0.051309
Training Epoch: 1 [25600/48750]	Loss: 0.3146	LR: 0.051832
Training Epoch: 1 [25856/48750]	Loss: 0.3353	LR: 0.052356
Training Epoch: 1 [26112/48750]	Loss: 0.2363	LR: 0.052880
Training Epoch: 1 [26368/48750]	Loss: 0.2167	LR: 0.053403
Training Epoch: 1 [26624/48750]	Loss: 0.2293	LR: 0.053927
Training Epoch: 1 [26880/48750]	Loss: 0.1785	LR: 0.054450
Training Epoch: 1 [27136/48750]	Loss: 0.2570	LR: 0.054974
Training Epoch: 1 [27392/48750]	Loss: 0.2534	LR: 0.055497
Training Epoch: 1 [27648/48750]	Loss: 0.1476	LR: 0.056021
Training Epoch: 1 [27904/48750]	Loss: 0.2679	LR: 0.056545
Training Epoch: 1 [28160/48750]	Loss: 0.2207	LR: 0.057068
Training Epoch: 1 [28416/48750]	Loss: 0.1991	LR: 0.057592
Training Epoch: 1 [28672/48750]	Loss: 0.1707	LR: 0.058115
Training Epoch: 1 [28928/48750]	Loss: 0.2827	LR: 0.058639
Training Epoch: 1 [29184/48750]	Loss: 0.2949	LR: 0.059162
Training Epoch: 1 [29440/48750]	Loss: 0.2076	LR: 0.059686
Training Epoch: 1 [29696/48750]	Loss: 0.1168	LR: 0.060209
Training Epoch: 1 [29952/48750]	Loss: 0.1844	LR: 0.060733
Training Epoch: 1 [30208/48750]	Loss: 0.1724	LR: 0.061257
Training Epoch: 1 [30464/48750]	Loss: 0.2902	LR: 0.061780
Training Epoch: 1 [30720/48750]	Loss: 0.1972	LR: 0.062304
Training Epoch: 1 [30976/48750]	Loss: 0.1846	LR: 0.062827
Training Epoch: 1 [31232/48750]	Loss: 0.2310	LR: 0.063351
Training Epoch: 1 [31488/48750]	Loss: 0.4859	LR: 0.063874
Training Epoch: 1 [31744/48750]	Loss: 0.3662	LR: 0.064398
Training Epoch: 1 [32000/48750]	Loss: 0.2794	LR: 0.064921
Training Epoch: 1 [32256/48750]	Loss: 0.2891	LR: 0.065445
Training Epoch: 1 [32512/48750]	Loss: 0.3593	LR: 0.065969
Training Epoch: 1 [32768/48750]	Loss: 0.3216	LR: 0.066492
Training Epoch: 1 [33024/48750]	Loss: 0.1650	LR: 0.067016
Training Epoch: 1 [33280/48750]	Loss: 0.2114	LR: 0.067539
Training Epoch: 1 [33536/48750]	Loss: 0.3239	LR: 0.068063
Training Epoch: 1 [33792/48750]	Loss: 0.2028	LR: 0.068586
Training Epoch: 1 [34048/48750]	Loss: 0.2153	LR: 0.069110
Training Epoch: 1 [34304/48750]	Loss: 0.4126	LR: 0.069634
Training Epoch: 1 [34560/48750]	Loss: 0.2030	LR: 0.070157
Training Epoch: 1 [34816/48750]	Loss: 0.3107	LR: 0.070681
Training Epoch: 1 [35072/48750]	Loss: 0.2922	LR: 0.071204
Training Epoch: 1 [35328/48750]	Loss: 0.2246	LR: 0.071728
Training Epoch: 1 [35584/48750]	Loss: 0.3009	LR: 0.072251
Training Epoch: 1 [35840/48750]	Loss: 0.3004	LR: 0.072775
Training Epoch: 1 [36096/48750]	Loss: 0.3692	LR: 0.073298
Training Epoch: 1 [36352/48750]	Loss: 0.3072	LR: 0.073822
Training Epoch: 1 [36608/48750]	Loss: 0.2897	LR: 0.074346
Training Epoch: 1 [36864/48750]	Loss: 0.3849	LR: 0.074869
Training Epoch: 1 [37120/48750]	Loss: 0.3313	LR: 0.075393
Training Epoch: 1 [37376/48750]	Loss: 0.3375	LR: 0.075916
Training Epoch: 1 [37632/48750]	Loss: 0.3848	LR: 0.076440
Training Epoch: 1 [37888/48750]	Loss: 0.3230	LR: 0.076963
Training Epoch: 1 [38144/48750]	Loss: 0.3143	LR: 0.077487
Training Epoch: 1 [38400/48750]	Loss: 0.4075	LR: 0.078010
Training Epoch: 1 [38656/48750]	Loss: 0.3242	LR: 0.078534
Training Epoch: 1 [38912/48750]	Loss: 0.1640	LR: 0.079058
Training Epoch: 1 [39168/48750]	Loss: 0.2688	LR: 0.079581
Training Epoch: 1 [39424/48750]	Loss: 0.4633	LR: 0.080105
Training Epoch: 1 [39680/48750]	Loss: 0.4610	LR: 0.080628
Training Epoch: 1 [39936/48750]	Loss: 0.4590	LR: 0.081152
Training Epoch: 1 [40192/48750]	Loss: 0.3410	LR: 0.081675
Training Epoch: 1 [40448/48750]	Loss: 0.7435	LR: 0.082199
Training Epoch: 1 [40704/48750]	Loss: 0.5275	LR: 0.082723
Training Epoch: 1 [40960/48750]	Loss: 0.8597	LR: 0.083246
Training Epoch: 1 [41216/48750]	Loss: 0.7557	LR: 0.083770
Training Epoch: 1 [41472/48750]	Loss: 0.7347	LR: 0.084293
Training Epoch: 1 [41728/48750]	Loss: 0.5492	LR: 0.084817
Training Epoch: 1 [41984/48750]	Loss: 0.5990	LR: 0.085340
Training Epoch: 1 [42240/48750]	Loss: 0.6229	LR: 0.085864
Training Epoch: 1 [42496/48750]	Loss: 0.6834	LR: 0.086387
Training Epoch: 1 [42752/48750]	Loss: 0.6632	LR: 0.086911
Training Epoch: 1 [43008/48750]	Loss: 0.7496	LR: 0.087435
Training Epoch: 1 [43264/48750]	Loss: 0.6805	LR: 0.087958
Training Epoch: 1 [43520/48750]	Loss: 0.6204	LR: 0.088482
Training Epoch: 1 [43776/48750]	Loss: 0.6489	LR: 0.089005
Training Epoch: 1 [44032/48750]	Loss: 0.4955	LR: 0.089529
Training Epoch: 1 [44288/48750]	Loss: 0.6043	LR: 0.090052
Training Epoch: 1 [44544/48750]	Loss: 0.5130	LR: 0.090576
Training Epoch: 1 [44800/48750]	Loss: 0.4974	LR: 0.091099
Training Epoch: 1 [45056/48750]	Loss: 0.5664	LR: 0.091623
Training Epoch: 1 [45312/48750]	Loss: 0.3819	LR: 0.092147
Training Epoch: 1 [45568/48750]	Loss: 0.5911	LR: 0.092670
Training Epoch: 1 [45824/48750]	Loss: 0.4847	LR: 0.093194
Training Epoch: 1 [46080/48750]	Loss: 0.4758	LR: 0.093717
Training Epoch: 1 [46336/48750]	Loss: 0.4571	LR: 0.094241
Training Epoch: 1 [46592/48750]	Loss: 0.4322	LR: 0.094764
Training Epoch: 1 [46848/48750]	Loss: 0.3621	LR: 0.095288
Training Epoch: 1 [47104/48750]	Loss: 0.3762	LR: 0.095812
Training Epoch: 1 [47360/48750]	Loss: 0.3616	LR: 0.096335
Training Epoch: 1 [47616/48750]	Loss: 0.2870	LR: 0.096859
Training Epoch: 1 [47872/48750]	Loss: 0.5214	LR: 0.097382
Training Epoch: 1 [48128/48750]	Loss: 0.3240	LR: 0.097906
Training Epoch: 1 [48384/48750]	Loss: 0.3932	LR: 0.098429
Training Epoch: 1 [48640/48750]	Loss: 0.3466	LR: 0.098953
Training Epoch: 1 [48750/48750]	Loss: 0.2963	LR: 0.099476
Epoch 1 - Average Train Loss: 0.4132, Train Accuracy: 0.8680
Epoch 1 training time consumed: 352.63s
Evaluating Network.....
Test set: Epoch: 1, Average loss: 0.0012, Accuracy: 0.9010, Time consumed:23.47s
Saving weights file to checkpoint/retrain/ViT/Sunday_20_July_2025_13h_32m_51s/ViT-Cifar10-seed10-ret75-1-best.pth
Training Epoch: 2 [256/48750]	Loss: 0.3182	LR: 0.100000
Training Epoch: 2 [512/48750]	Loss: 0.3313	LR: 0.100000
Training Epoch: 2 [768/48750]	Loss: 0.4205	LR: 0.100000
Training Epoch: 2 [1024/48750]	Loss: 0.3754	LR: 0.100000
Training Epoch: 2 [1280/48750]	Loss: 0.5152	LR: 0.100000
Training Epoch: 2 [1536/48750]	Loss: 0.3814	LR: 0.100000
Training Epoch: 2 [1792/48750]	Loss: 0.2632	LR: 0.100000
Training Epoch: 2 [2048/48750]	Loss: 0.3789	LR: 0.100000
Training Epoch: 2 [2304/48750]	Loss: 0.4093	LR: 0.100000
Training Epoch: 2 [2560/48750]	Loss: 0.3619	LR: 0.100000
Training Epoch: 2 [2816/48750]	Loss: 0.2737	LR: 0.100000
Training Epoch: 2 [3072/48750]	Loss: 0.3756	LR: 0.100000
Training Epoch: 2 [3328/48750]	Loss: 0.2553	LR: 0.100000
Training Epoch: 2 [3584/48750]	Loss: 0.2808	LR: 0.100000
Training Epoch: 2 [3840/48750]	Loss: 0.2656	LR: 0.100000
Training Epoch: 2 [4096/48750]	Loss: 0.3595	LR: 0.100000
Training Epoch: 2 [4352/48750]	Loss: 0.3226	LR: 0.100000
Training Epoch: 2 [4608/48750]	Loss: 0.2472	LR: 0.100000
Training Epoch: 2 [4864/48750]	Loss: 0.3380	LR: 0.100000
Training Epoch: 2 [5120/48750]	Loss: 0.3168	LR: 0.100000
Training Epoch: 2 [5376/48750]	Loss: 0.2657	LR: 0.100000
Training Epoch: 2 [5632/48750]	Loss: 0.2882	LR: 0.100000
Training Epoch: 2 [5888/48750]	Loss: 0.2906	LR: 0.100000
Training Epoch: 2 [6144/48750]	Loss: 0.3068	LR: 0.100000
Training Epoch: 2 [6400/48750]	Loss: 0.3449	LR: 0.100000
Training Epoch: 2 [6656/48750]	Loss: 0.2779	LR: 0.100000
Training Epoch: 2 [6912/48750]	Loss: 0.1618	LR: 0.100000
Training Epoch: 2 [7168/48750]	Loss: 0.2694	LR: 0.100000
Training Epoch: 2 [7424/48750]	Loss: 0.2137	LR: 0.100000
Training Epoch: 2 [7680/48750]	Loss: 0.1900	LR: 0.100000
Training Epoch: 2 [7936/48750]	Loss: 0.2238	LR: 0.100000
Training Epoch: 2 [8192/48750]	Loss: 0.2778	LR: 0.100000
Training Epoch: 2 [8448/48750]	Loss: 0.2193	LR: 0.100000
Training Epoch: 2 [8704/48750]	Loss: 0.1902	LR: 0.100000
Training Epoch: 2 [8960/48750]	Loss: 0.2121	LR: 0.100000
Training Epoch: 2 [9216/48750]	Loss: 0.3472	LR: 0.100000
Training Epoch: 2 [9472/48750]	Loss: 0.1797	LR: 0.100000
Training Epoch: 2 [9728/48750]	Loss: 0.2994	LR: 0.100000
Training Epoch: 2 [9984/48750]	Loss: 0.2641	LR: 0.100000
Training Epoch: 2 [10240/48750]	Loss: 0.2237	LR: 0.100000
Training Epoch: 2 [10496/48750]	Loss: 0.1837	LR: 0.100000
Training Epoch: 2 [10752/48750]	Loss: 0.1684	LR: 0.100000
Training Epoch: 2 [11008/48750]	Loss: 0.2443	LR: 0.100000
Training Epoch: 2 [11264/48750]	Loss: 0.2064	LR: 0.100000
Training Epoch: 2 [11520/48750]	Loss: 0.1803	LR: 0.100000
Training Epoch: 2 [11776/48750]	Loss: 0.1917	LR: 0.100000
Training Epoch: 2 [12032/48750]	Loss: 0.1996	LR: 0.100000
Training Epoch: 2 [12288/48750]	Loss: 0.2071	LR: 0.100000
Training Epoch: 2 [12544/48750]	Loss: 0.1916	LR: 0.100000
Training Epoch: 2 [12800/48750]	Loss: 0.2200	LR: 0.100000
Training Epoch: 2 [13056/48750]	Loss: 0.2393	LR: 0.100000
Training Epoch: 2 [13312/48750]	Loss: 0.2303	LR: 0.100000
Training Epoch: 2 [13568/48750]	Loss: 0.2338	LR: 0.100000
Training Epoch: 2 [13824/48750]	Loss: 0.2462	LR: 0.100000
Training Epoch: 2 [14080/48750]	Loss: 0.1806	LR: 0.100000
Training Epoch: 2 [14336/48750]	Loss: 0.2307	LR: 0.100000
Training Epoch: 2 [14592/48750]	Loss: 0.1874	LR: 0.100000
Training Epoch: 2 [14848/48750]	Loss: 0.1647	LR: 0.100000
Training Epoch: 2 [15104/48750]	Loss: 0.1830	LR: 0.100000
Training Epoch: 2 [15360/48750]	Loss: 0.3719	LR: 0.100000
Training Epoch: 2 [15616/48750]	Loss: 0.1442	LR: 0.100000
Training Epoch: 2 [15872/48750]	Loss: 0.2019	LR: 0.100000
Training Epoch: 2 [16128/48750]	Loss: 0.3057	LR: 0.100000
Training Epoch: 2 [16384/48750]	Loss: 0.2788	LR: 0.100000
Training Epoch: 2 [16640/48750]	Loss: 0.3007	LR: 0.100000
Training Epoch: 2 [16896/48750]	Loss: 0.2470	LR: 0.100000
Training Epoch: 2 [17152/48750]	Loss: 0.2535	LR: 0.100000
Training Epoch: 2 [17408/48750]	Loss: 0.3084	LR: 0.100000
Training Epoch: 2 [17664/48750]	Loss: 0.2869	LR: 0.100000
Training Epoch: 2 [17920/48750]	Loss: 0.2909	LR: 0.100000
Training Epoch: 2 [18176/48750]	Loss: 0.3183	LR: 0.100000
Training Epoch: 2 [18432/48750]	Loss: 0.2385	LR: 0.100000
Training Epoch: 2 [18688/48750]	Loss: 0.2068	LR: 0.100000
Training Epoch: 2 [18944/48750]	Loss: 0.3020	LR: 0.100000
Training Epoch: 2 [19200/48750]	Loss: 0.2953	LR: 0.100000
Training Epoch: 2 [19456/48750]	Loss: 0.2662	LR: 0.100000
Training Epoch: 2 [19712/48750]	Loss: 0.3418	LR: 0.100000
Training Epoch: 2 [19968/48750]	Loss: 0.2830	LR: 0.100000
Training Epoch: 2 [20224/48750]	Loss: 0.3313	LR: 0.100000
Training Epoch: 2 [20480/48750]	Loss: 0.2815	LR: 0.100000
Training Epoch: 2 [20736/48750]	Loss: 0.1803	LR: 0.100000
Training Epoch: 2 [20992/48750]	Loss: 0.2413	LR: 0.100000
Training Epoch: 2 [21248/48750]	Loss: 0.2586	LR: 0.100000
Training Epoch: 2 [21504/48750]	Loss: 0.2394	LR: 0.100000
Training Epoch: 2 [21760/48750]	Loss: 0.2289	LR: 0.100000
Training Epoch: 2 [22016/48750]	Loss: 0.2553	LR: 0.100000
Training Epoch: 2 [22272/48750]	Loss: 0.2344	LR: 0.100000
Training Epoch: 2 [22528/48750]	Loss: 0.2075	LR: 0.100000
Training Epoch: 2 [22784/48750]	Loss: 0.2022	LR: 0.100000
Training Epoch: 2 [23040/48750]	Loss: 0.1320	LR: 0.100000
Training Epoch: 2 [23296/48750]	Loss: 0.2838	LR: 0.100000
Training Epoch: 2 [23552/48750]	Loss: 0.1665	LR: 0.100000
Training Epoch: 2 [23808/48750]	Loss: 0.2071	LR: 0.100000
Training Epoch: 2 [24064/48750]	Loss: 0.1584	LR: 0.100000
Training Epoch: 2 [24320/48750]	Loss: 0.1631	LR: 0.100000
Training Epoch: 2 [24576/48750]	Loss: 0.2670	LR: 0.100000
Training Epoch: 2 [24832/48750]	Loss: 0.2313	LR: 0.100000
Training Epoch: 2 [25088/48750]	Loss: 0.2908	LR: 0.100000
Training Epoch: 2 [25344/48750]	Loss: 0.2790	LR: 0.100000
Training Epoch: 2 [25600/48750]	Loss: 0.6144	LR: 0.100000
Training Epoch: 2 [25856/48750]	Loss: 0.2531	LR: 0.100000
Training Epoch: 2 [26112/48750]	Loss: 0.3578	LR: 0.100000
Training Epoch: 2 [26368/48750]	Loss: 0.3192	LR: 0.100000
Training Epoch: 2 [26624/48750]	Loss: 0.2548	LR: 0.100000
Training Epoch: 2 [26880/48750]	Loss: 0.3912	LR: 0.100000
Training Epoch: 2 [27136/48750]	Loss: 0.3143	LR: 0.100000
Training Epoch: 2 [27392/48750]	Loss: 0.3339	LR: 0.100000
Training Epoch: 2 [27648/48750]	Loss: 0.3410	LR: 0.100000
Training Epoch: 2 [27904/48750]	Loss: 0.2382	LR: 0.100000
Training Epoch: 2 [28160/48750]	Loss: 0.2465	LR: 0.100000
Training Epoch: 2 [28416/48750]	Loss: 0.2702	LR: 0.100000
Training Epoch: 2 [28672/48750]	Loss: 0.3077	LR: 0.100000
Training Epoch: 2 [28928/48750]	Loss: 0.3023	LR: 0.100000
Training Epoch: 2 [29184/48750]	Loss: 0.2457	LR: 0.100000
Training Epoch: 2 [29440/48750]	Loss: 0.3003	LR: 0.100000
Training Epoch: 2 [29696/48750]	Loss: 0.2907	LR: 0.100000
Training Epoch: 2 [29952/48750]	Loss: 0.1882	LR: 0.100000
Training Epoch: 2 [30208/48750]	Loss: 0.2747	LR: 0.100000
Training Epoch: 2 [30464/48750]	Loss: 0.2396	LR: 0.100000
Training Epoch: 2 [30720/48750]	Loss: 0.2995	LR: 0.100000
Training Epoch: 2 [30976/48750]	Loss: 0.4068	LR: 0.100000
Training Epoch: 2 [31232/48750]	Loss: 0.2503	LR: 0.100000
Training Epoch: 2 [31488/48750]	Loss: 0.2203	LR: 0.100000
Training Epoch: 2 [31744/48750]	Loss: 0.2523	LR: 0.100000
Training Epoch: 2 [32000/48750]	Loss: 0.2880	LR: 0.100000
Training Epoch: 2 [32256/48750]	Loss: 0.2179	LR: 0.100000
Training Epoch: 2 [32512/48750]	Loss: 0.2364	LR: 0.100000
Training Epoch: 2 [32768/48750]	Loss: 0.2542	LR: 0.100000
Training Epoch: 2 [33024/48750]	Loss: 0.2889	LR: 0.100000
Training Epoch: 2 [33280/48750]	Loss: 0.2966	LR: 0.100000
Training Epoch: 2 [33536/48750]	Loss: 0.3664	LR: 0.100000
Training Epoch: 2 [33792/48750]	Loss: 0.2110	LR: 0.100000
Training Epoch: 2 [34048/48750]	Loss: 0.2535	LR: 0.100000
Training Epoch: 2 [34304/48750]	Loss: 0.2980	LR: 0.100000
Training Epoch: 2 [34560/48750]	Loss: 0.2302	LR: 0.100000
Training Epoch: 2 [34816/48750]	Loss: 0.1755	LR: 0.100000
Training Epoch: 2 [35072/48750]	Loss: 0.2728	LR: 0.100000
Training Epoch: 2 [35328/48750]	Loss: 0.2790	LR: 0.100000
Training Epoch: 2 [35584/48750]	Loss: 0.3828	LR: 0.100000
Training Epoch: 2 [35840/48750]	Loss: 0.3627	LR: 0.100000
Training Epoch: 2 [36096/48750]	Loss: 0.2265	LR: 0.100000
Training Epoch: 2 [36352/48750]	Loss: 0.2760	LR: 0.100000
Training Epoch: 2 [36608/48750]	Loss: 0.2696	LR: 0.100000
Training Epoch: 2 [36864/48750]	Loss: 0.3126	LR: 0.100000
Training Epoch: 2 [37120/48750]	Loss: 0.3577	LR: 0.100000
Training Epoch: 2 [37376/48750]	Loss: 0.1995	LR: 0.100000
Training Epoch: 2 [37632/48750]	Loss: 0.2179	LR: 0.100000
Training Epoch: 2 [37888/48750]	Loss: 0.2413	LR: 0.100000
Training Epoch: 2 [38144/48750]	Loss: 0.2839	LR: 0.100000
Training Epoch: 2 [38400/48750]	Loss: 0.1555	LR: 0.100000
Training Epoch: 2 [38656/48750]	Loss: 0.2529	LR: 0.100000
Training Epoch: 2 [38912/48750]	Loss: 0.2655	LR: 0.100000
Training Epoch: 2 [39168/48750]	Loss: 0.2840	LR: 0.100000
Training Epoch: 2 [39424/48750]	Loss: 0.1675	LR: 0.100000
Training Epoch: 2 [39680/48750]	Loss: 0.2468	LR: 0.100000
Training Epoch: 2 [39936/48750]	Loss: 0.2364	LR: 0.100000
Training Epoch: 2 [40192/48750]	Loss: 0.2260	LR: 0.100000
Training Epoch: 2 [40448/48750]	Loss: 0.2310	LR: 0.100000
Training Epoch: 2 [40704/48750]	Loss: 0.2001	LR: 0.100000
Training Epoch: 2 [40960/48750]	Loss: 0.2481	LR: 0.100000
Training Epoch: 2 [41216/48750]	Loss: 0.1737	LR: 0.100000
Training Epoch: 2 [41472/48750]	Loss: 0.2086	LR: 0.100000
Training Epoch: 2 [41728/48750]	Loss: 0.3596	LR: 0.100000
Training Epoch: 2 [41984/48750]	Loss: 0.2111	LR: 0.100000
Training Epoch: 2 [42240/48750]	Loss: 0.2447	LR: 0.100000
Training Epoch: 2 [42496/48750]	Loss: 0.2362	LR: 0.100000
Training Epoch: 2 [42752/48750]	Loss: 0.1655	LR: 0.100000
Training Epoch: 2 [43008/48750]	Loss: 0.2919	LR: 0.100000
Training Epoch: 2 [43264/48750]	Loss: 0.3188	LR: 0.100000
Training Epoch: 2 [43520/48750]	Loss: 0.2349	LR: 0.100000
Training Epoch: 2 [43776/48750]	Loss: 0.1966	LR: 0.100000
Training Epoch: 2 [44032/48750]	Loss: 0.1812	LR: 0.100000
Training Epoch: 2 [44288/48750]	Loss: 0.2923	LR: 0.100000
Training Epoch: 2 [44544/48750]	Loss: 0.3020	LR: 0.100000
Training Epoch: 2 [44800/48750]	Loss: 0.1768	LR: 0.100000
Training Epoch: 2 [45056/48750]	Loss: 0.2144	LR: 0.100000
Training Epoch: 2 [45312/48750]	Loss: 0.2137	LR: 0.100000
Training Epoch: 2 [45568/48750]	Loss: 0.1324	LR: 0.100000
Training Epoch: 2 [45824/48750]	Loss: 0.1807	LR: 0.100000
Training Epoch: 2 [46080/48750]	Loss: 0.2773	LR: 0.100000
Training Epoch: 2 [46336/48750]	Loss: 0.1992	LR: 0.100000
Training Epoch: 2 [46592/48750]	Loss: 0.1898	LR: 0.100000
Training Epoch: 2 [46848/48750]	Loss: 0.1797	LR: 0.100000
Training Epoch: 2 [47104/48750]	Loss: 0.2404	LR: 0.100000
Training Epoch: 2 [47360/48750]	Loss: 0.1984	LR: 0.100000
Training Epoch: 2 [47616/48750]	Loss: 0.2200	LR: 0.100000
Training Epoch: 2 [47872/48750]	Loss: 0.2102	LR: 0.100000
Training Epoch: 2 [48128/48750]	Loss: 0.2303	LR: 0.100000
Training Epoch: 2 [48384/48750]	Loss: 0.1832	LR: 0.100000
Training Epoch: 2 [48640/48750]	Loss: 0.1422	LR: 0.100000
Training Epoch: 2 [48750/48750]	Loss: 0.1877	LR: 0.100000
Epoch 2 - Average Train Loss: 0.2590, Train Accuracy: 0.9127
Epoch 2 training time consumed: 351.74s
Evaluating Network.....
Test set: Epoch: 2, Average loss: 0.0005, Accuracy: 0.9552, Time consumed:23.46s
Saving weights file to checkpoint/retrain/ViT/Sunday_20_July_2025_13h_32m_51s/ViT-Cifar10-seed10-ret75-2-best.pth
Training Epoch: 3 [256/48750]	Loss: 0.1353	LR: 0.100000
Training Epoch: 3 [512/48750]	Loss: 0.1872	LR: 0.100000
Training Epoch: 3 [768/48750]	Loss: 0.1142	LR: 0.100000
Training Epoch: 3 [1024/48750]	Loss: 0.2519	LR: 0.100000
Training Epoch: 3 [1280/48750]	Loss: 0.1836	LR: 0.100000
Training Epoch: 3 [1536/48750]	Loss: 0.1218	LR: 0.100000
Training Epoch: 3 [1792/48750]	Loss: 0.1592	LR: 0.100000
Training Epoch: 3 [2048/48750]	Loss: 0.1465	LR: 0.100000
Training Epoch: 3 [2304/48750]	Loss: 0.1941	LR: 0.100000
Training Epoch: 3 [2560/48750]	Loss: 0.0971	LR: 0.100000
Training Epoch: 3 [2816/48750]	Loss: 0.1140	LR: 0.100000
Training Epoch: 3 [3072/48750]	Loss: 0.1939	LR: 0.100000
Training Epoch: 3 [3328/48750]	Loss: 0.1418	LR: 0.100000
Training Epoch: 3 [3584/48750]	Loss: 0.1467	LR: 0.100000
Training Epoch: 3 [3840/48750]	Loss: 0.1622	LR: 0.100000
Training Epoch: 3 [4096/48750]	Loss: 0.1630	LR: 0.100000
Training Epoch: 3 [4352/48750]	Loss: 0.2053	LR: 0.100000
Training Epoch: 3 [4608/48750]	Loss: 0.1932	LR: 0.100000
Training Epoch: 3 [4864/48750]	Loss: 0.1054	LR: 0.100000
Training Epoch: 3 [5120/48750]	Loss: 0.1286	LR: 0.100000
Training Epoch: 3 [5376/48750]	Loss: 0.1182	LR: 0.100000
Training Epoch: 3 [5632/48750]	Loss: 0.1475	LR: 0.100000
Training Epoch: 3 [5888/48750]	Loss: 0.1060	LR: 0.100000
Training Epoch: 3 [6144/48750]	Loss: 0.1757	LR: 0.100000
Training Epoch: 3 [6400/48750]	Loss: 0.1505	LR: 0.100000
Training Epoch: 3 [6656/48750]	Loss: 0.1507	LR: 0.100000
Training Epoch: 3 [6912/48750]	Loss: 0.1695	LR: 0.100000
Training Epoch: 3 [7168/48750]	Loss: 0.2112	LR: 0.100000
Training Epoch: 3 [7424/48750]	Loss: 0.1283	LR: 0.100000
Training Epoch: 3 [7680/48750]	Loss: 0.2073	LR: 0.100000
Training Epoch: 3 [7936/48750]	Loss: 0.1546	LR: 0.100000
Training Epoch: 3 [8192/48750]	Loss: 0.2156	LR: 0.100000
Training Epoch: 3 [8448/48750]	Loss: 0.1071	LR: 0.100000
Training Epoch: 3 [8704/48750]	Loss: 0.1031	LR: 0.100000
Training Epoch: 3 [8960/48750]	Loss: 0.1598	LR: 0.100000
Training Epoch: 3 [9216/48750]	Loss: 0.1425	LR: 0.100000
Training Epoch: 3 [9472/48750]	Loss: 0.1282	LR: 0.100000
Training Epoch: 3 [9728/48750]	Loss: 0.1195	LR: 0.100000
Training Epoch: 3 [9984/48750]	Loss: 0.1987	LR: 0.100000
Training Epoch: 3 [10240/48750]	Loss: 0.1571	LR: 0.100000
Training Epoch: 3 [10496/48750]	Loss: 0.1488	LR: 0.100000
Training Epoch: 3 [10752/48750]	Loss: 0.1764	LR: 0.100000
Training Epoch: 3 [11008/48750]	Loss: 0.2103	LR: 0.100000
Training Epoch: 3 [11264/48750]	Loss: 0.1076	LR: 0.100000
Training Epoch: 3 [11520/48750]	Loss: 0.1751	LR: 0.100000
Training Epoch: 3 [11776/48750]	Loss: 0.1624	LR: 0.100000
Training Epoch: 3 [12032/48750]	Loss: 0.1032	LR: 0.100000
Training Epoch: 3 [12288/48750]	Loss: 0.1434	LR: 0.100000
Training Epoch: 3 [12544/48750]	Loss: 0.0891	LR: 0.100000
Training Epoch: 3 [12800/48750]	Loss: 0.1112	LR: 0.100000
Training Epoch: 3 [13056/48750]	Loss: 0.1233	LR: 0.100000
Training Epoch: 3 [13312/48750]	Loss: 0.1465	LR: 0.100000
Training Epoch: 3 [13568/48750]	Loss: 0.1470	LR: 0.100000
Training Epoch: 3 [13824/48750]	Loss: 0.2262	LR: 0.100000
Training Epoch: 3 [14080/48750]	Loss: 0.1654	LR: 0.100000
Training Epoch: 3 [14336/48750]	Loss: 0.1638	LR: 0.100000
Training Epoch: 3 [14592/48750]	Loss: 0.2500	LR: 0.100000
Training Epoch: 3 [14848/48750]	Loss: 0.1401	LR: 0.100000
Training Epoch: 3 [15104/48750]	Loss: 0.1810	LR: 0.100000
Training Epoch: 3 [15360/48750]	Loss: 0.1084	LR: 0.100000
Training Epoch: 3 [15616/48750]	Loss: 0.2122	LR: 0.100000
Training Epoch: 3 [15872/48750]	Loss: 0.1323	LR: 0.100000
Training Epoch: 3 [16128/48750]	Loss: 0.1224	LR: 0.100000
Training Epoch: 3 [16384/48750]	Loss: 0.1036	LR: 0.100000
Training Epoch: 3 [16640/48750]	Loss: 0.1702	LR: 0.100000
Training Epoch: 3 [16896/48750]	Loss: 0.1436	LR: 0.100000
Training Epoch: 3 [17152/48750]	Loss: 0.1893	LR: 0.100000
Training Epoch: 3 [17408/48750]	Loss: 0.1025	LR: 0.100000
Training Epoch: 3 [17664/48750]	Loss: 0.1900	LR: 0.100000
Training Epoch: 3 [17920/48750]	Loss: 0.2113	LR: 0.100000
Training Epoch: 3 [18176/48750]	Loss: 0.1685	LR: 0.100000
Training Epoch: 3 [18432/48750]	Loss: 0.2113	LR: 0.100000
Training Epoch: 3 [18688/48750]	Loss: 0.1955	LR: 0.100000
Training Epoch: 3 [18944/48750]	Loss: 0.1565	LR: 0.100000
Training Epoch: 3 [19200/48750]	Loss: 0.1281	LR: 0.100000
Training Epoch: 3 [19456/48750]	Loss: 0.1684	LR: 0.100000
Training Epoch: 3 [19712/48750]	Loss: 0.1739	LR: 0.100000
Training Epoch: 3 [19968/48750]	Loss: 0.1169	LR: 0.100000
Training Epoch: 3 [20224/48750]	Loss: 0.0895	LR: 0.100000
Training Epoch: 3 [20480/48750]	Loss: 0.1590	LR: 0.100000
Training Epoch: 3 [20736/48750]	Loss: 0.2169	LR: 0.100000
Training Epoch: 3 [20992/48750]	Loss: 0.1169	LR: 0.100000
Training Epoch: 3 [21248/48750]	Loss: 0.1274	LR: 0.100000
Training Epoch: 3 [21504/48750]	Loss: 0.1749	LR: 0.100000
Training Epoch: 3 [21760/48750]	Loss: 0.1737	LR: 0.100000
Training Epoch: 3 [22016/48750]	Loss: 0.1452	LR: 0.100000
Training Epoch: 3 [22272/48750]	Loss: 0.1312	LR: 0.100000
Training Epoch: 3 [22528/48750]	Loss: 0.1331	LR: 0.100000
Training Epoch: 3 [22784/48750]	Loss: 0.1459	LR: 0.100000
Training Epoch: 3 [23040/48750]	Loss: 0.1851	LR: 0.100000
Training Epoch: 3 [23296/48750]	Loss: 0.1400	LR: 0.100000
Training Epoch: 3 [23552/48750]	Loss: 0.1716	LR: 0.100000
Training Epoch: 3 [23808/48750]	Loss: 0.1397	LR: 0.100000
Training Epoch: 3 [24064/48750]	Loss: 0.1324	LR: 0.100000
Training Epoch: 3 [24320/48750]	Loss: 0.1092	LR: 0.100000
Training Epoch: 3 [24576/48750]	Loss: 0.1035	LR: 0.100000
Training Epoch: 3 [24832/48750]	Loss: 0.2043	LR: 0.100000
Training Epoch: 3 [25088/48750]	Loss: 0.1300	LR: 0.100000
Training Epoch: 3 [25344/48750]	Loss: 0.1010	LR: 0.100000
Training Epoch: 3 [25600/48750]	Loss: 0.0532	LR: 0.100000
Training Epoch: 3 [25856/48750]	Loss: 0.1344	LR: 0.100000
Training Epoch: 3 [26112/48750]	Loss: 0.1691	LR: 0.100000
Training Epoch: 3 [26368/48750]	Loss: 0.0811	LR: 0.100000
Training Epoch: 3 [26624/48750]	Loss: 0.1508	LR: 0.100000
Training Epoch: 3 [26880/48750]	Loss: 0.0801	LR: 0.100000
Training Epoch: 3 [27136/48750]	Loss: 0.0901	LR: 0.100000
Training Epoch: 3 [27392/48750]	Loss: 0.1241	LR: 0.100000
Training Epoch: 3 [27648/48750]	Loss: 0.1507	LR: 0.100000
Training Epoch: 3 [27904/48750]	Loss: 0.1540	LR: 0.100000
Training Epoch: 3 [28160/48750]	Loss: 0.1482	LR: 0.100000
Training Epoch: 3 [28416/48750]	Loss: 0.1480	LR: 0.100000
Training Epoch: 3 [28672/48750]	Loss: 0.1421	LR: 0.100000
Training Epoch: 3 [28928/48750]	Loss: 0.2166	LR: 0.100000
Training Epoch: 3 [29184/48750]	Loss: 0.2142	LR: 0.100000
Training Epoch: 3 [29440/48750]	Loss: 0.1398	LR: 0.100000
Training Epoch: 3 [29696/48750]	Loss: 0.1816	LR: 0.100000
Training Epoch: 3 [29952/48750]	Loss: 0.2058	LR: 0.100000
Training Epoch: 3 [30208/48750]	Loss: 0.1591	LR: 0.100000
Training Epoch: 3 [30464/48750]	Loss: 0.1410	LR: 0.100000
Training Epoch: 3 [30720/48750]	Loss: 0.1401	LR: 0.100000
Training Epoch: 3 [30976/48750]	Loss: 0.1441	LR: 0.100000
Training Epoch: 3 [31232/48750]	Loss: 0.1933	LR: 0.100000
Training Epoch: 3 [31488/48750]	Loss: 0.1167	LR: 0.100000
Training Epoch: 3 [31744/48750]	Loss: 0.1270	LR: 0.100000
Training Epoch: 3 [32000/48750]	Loss: 0.1502	LR: 0.100000
Training Epoch: 3 [32256/48750]	Loss: 0.1962	LR: 0.100000
Training Epoch: 3 [32512/48750]	Loss: 0.1178	LR: 0.100000
Training Epoch: 3 [32768/48750]	Loss: 0.1175	LR: 0.100000
Training Epoch: 3 [33024/48750]	Loss: 0.1409	LR: 0.100000
Training Epoch: 3 [33280/48750]	Loss: 0.1639	LR: 0.100000
Training Epoch: 3 [33536/48750]	Loss: 0.1803	LR: 0.100000
Training Epoch: 3 [33792/48750]	Loss: 0.1266	LR: 0.100000
Training Epoch: 3 [34048/48750]	Loss: 0.1566	LR: 0.100000
Training Epoch: 3 [34304/48750]	Loss: 0.2007	LR: 0.100000
Training Epoch: 3 [34560/48750]	Loss: 0.1037	LR: 0.100000
Training Epoch: 3 [34816/48750]	Loss: 0.1295	LR: 0.100000
Training Epoch: 3 [35072/48750]	Loss: 0.1674	LR: 0.100000
Training Epoch: 3 [35328/48750]	Loss: 0.1818	LR: 0.100000
Training Epoch: 3 [35584/48750]	Loss: 0.1946	LR: 0.100000
Training Epoch: 3 [35840/48750]	Loss: 0.1835	LR: 0.100000
Training Epoch: 3 [36096/48750]	Loss: 0.1827	LR: 0.100000
Training Epoch: 3 [36352/48750]	Loss: 0.1320	LR: 0.100000
Training Epoch: 3 [36608/48750]	Loss: 0.1847	LR: 0.100000
Training Epoch: 3 [36864/48750]	Loss: 0.1034	LR: 0.100000
Training Epoch: 3 [37120/48750]	Loss: 0.1325	LR: 0.100000
Training Epoch: 3 [37376/48750]	Loss: 0.2572	LR: 0.100000
Training Epoch: 3 [37632/48750]	Loss: 0.1575	LR: 0.100000
Training Epoch: 3 [37888/48750]	Loss: 0.2215	LR: 0.100000
Training Epoch: 3 [38144/48750]	Loss: 0.1340	LR: 0.100000
Training Epoch: 3 [38400/48750]	Loss: 0.2548	LR: 0.100000
Training Epoch: 3 [38656/48750]	Loss: 0.2299	LR: 0.100000
Training Epoch: 3 [38912/48750]	Loss: 0.1714	LR: 0.100000
Training Epoch: 3 [39168/48750]	Loss: 0.1545	LR: 0.100000
Training Epoch: 3 [39424/48750]	Loss: 0.1565	LR: 0.100000
Training Epoch: 3 [39680/48750]	Loss: 0.2296	LR: 0.100000
Training Epoch: 3 [39936/48750]	Loss: 0.1941	LR: 0.100000
Training Epoch: 3 [40192/48750]	Loss: 0.2545	LR: 0.100000
Training Epoch: 3 [40448/48750]	Loss: 0.1961	LR: 0.100000
Training Epoch: 3 [40704/48750]	Loss: 0.1383	LR: 0.100000
Training Epoch: 3 [40960/48750]	Loss: 0.1811	LR: 0.100000
Training Epoch: 3 [41216/48750]	Loss: 0.2151	LR: 0.100000
Training Epoch: 3 [41472/48750]	Loss: 0.2137	LR: 0.100000
Training Epoch: 3 [41728/48750]	Loss: 0.1382	LR: 0.100000
Training Epoch: 3 [41984/48750]	Loss: 0.1081	LR: 0.100000
Training Epoch: 3 [42240/48750]	Loss: 0.1577	LR: 0.100000
Training Epoch: 3 [42496/48750]	Loss: 0.1565	LR: 0.100000
Training Epoch: 3 [42752/48750]	Loss: 0.1490	LR: 0.100000
Training Epoch: 3 [43008/48750]	Loss: 0.1531	LR: 0.100000
Training Epoch: 3 [43264/48750]	Loss: 0.1892	LR: 0.100000
Training Epoch: 3 [43520/48750]	Loss: 0.1488	LR: 0.100000
Training Epoch: 3 [43776/48750]	Loss: 0.1335	LR: 0.100000
Training Epoch: 3 [44032/48750]	Loss: 0.1425	LR: 0.100000
Training Epoch: 3 [44288/48750]	Loss: 0.0918	LR: 0.100000
Training Epoch: 3 [44544/48750]	Loss: 0.1260	LR: 0.100000
Training Epoch: 3 [44800/48750]	Loss: 0.2231	LR: 0.100000
Training Epoch: 3 [45056/48750]	Loss: 0.1821	LR: 0.100000
Training Epoch: 3 [45312/48750]	Loss: 0.0790	LR: 0.100000
Training Epoch: 3 [45568/48750]	Loss: 0.2451	LR: 0.100000
Training Epoch: 3 [45824/48750]	Loss: 0.1361	LR: 0.100000
Training Epoch: 3 [46080/48750]	Loss: 0.1445	LR: 0.100000
Training Epoch: 3 [46336/48750]	Loss: 0.1895	LR: 0.100000
Training Epoch: 3 [46592/48750]	Loss: 0.1547	LR: 0.100000
Training Epoch: 3 [46848/48750]	Loss: 0.1515	LR: 0.100000
Training Epoch: 3 [47104/48750]	Loss: 0.1878	LR: 0.100000
Training Epoch: 3 [47360/48750]	Loss: 0.1088	LR: 0.100000
Training Epoch: 3 [47616/48750]	Loss: 0.1894	LR: 0.100000
Training Epoch: 3 [47872/48750]	Loss: 0.1702	LR: 0.100000
Training Epoch: 3 [48128/48750]	Loss: 0.1339	LR: 0.100000
Training Epoch: 3 [48384/48750]	Loss: 0.1638	LR: 0.100000
Training Epoch: 3 [48640/48750]	Loss: 0.1409	LR: 0.100000
Training Epoch: 3 [48750/48750]	Loss: 0.0890	LR: 0.100000
Epoch 3 - Average Train Loss: 0.1562, Train Accuracy: 0.9471
Epoch 3 training time consumed: 351.90s
Evaluating Network.....
Test set: Epoch: 3, Average loss: 0.0005, Accuracy: 0.9622, Time consumed:23.46s
Saving weights file to checkpoint/retrain/ViT/Sunday_20_July_2025_13h_32m_51s/ViT-Cifar10-seed10-ret75-3-best.pth
Training Epoch: 4 [256/48750]	Loss: 0.1182	LR: 0.100000
Training Epoch: 4 [512/48750]	Loss: 0.1331	LR: 0.100000
Training Epoch: 4 [768/48750]	Loss: 0.0769	LR: 0.100000
Training Epoch: 4 [1024/48750]	Loss: 0.1777	LR: 0.100000
Training Epoch: 4 [1280/48750]	Loss: 0.0825	LR: 0.100000
Training Epoch: 4 [1536/48750]	Loss: 0.1077	LR: 0.100000
Training Epoch: 4 [1792/48750]	Loss: 0.1168	LR: 0.100000
Training Epoch: 4 [2048/48750]	Loss: 0.1300	LR: 0.100000
Training Epoch: 4 [2304/48750]	Loss: 0.0937	LR: 0.100000
Training Epoch: 4 [2560/48750]	Loss: 0.0717	LR: 0.100000
Training Epoch: 4 [2816/48750]	Loss: 0.0959	LR: 0.100000
Training Epoch: 4 [3072/48750]	Loss: 0.1170	LR: 0.100000
Training Epoch: 4 [3328/48750]	Loss: 0.0729	LR: 0.100000
Training Epoch: 4 [3584/48750]	Loss: 0.0922	LR: 0.100000
Training Epoch: 4 [3840/48750]	Loss: 0.1306	LR: 0.100000
Training Epoch: 4 [4096/48750]	Loss: 0.1050	LR: 0.100000
Training Epoch: 4 [4352/48750]	Loss: 0.1383	LR: 0.100000
Training Epoch: 4 [4608/48750]	Loss: 0.1493	LR: 0.100000
Training Epoch: 4 [4864/48750]	Loss: 0.0829	LR: 0.100000
Training Epoch: 4 [5120/48750]	Loss: 0.1345	LR: 0.100000
Training Epoch: 4 [5376/48750]	Loss: 0.1034	LR: 0.100000
Training Epoch: 4 [5632/48750]	Loss: 0.1338	LR: 0.100000
Training Epoch: 4 [5888/48750]	Loss: 0.1081	LR: 0.100000
Training Epoch: 4 [6144/48750]	Loss: 0.0825	LR: 0.100000
Training Epoch: 4 [6400/48750]	Loss: 0.1835	LR: 0.100000
Training Epoch: 4 [6656/48750]	Loss: 0.1270	LR: 0.100000
Training Epoch: 4 [6912/48750]	Loss: 0.1723	LR: 0.100000
Training Epoch: 4 [7168/48750]	Loss: 0.1206	LR: 0.100000
Training Epoch: 4 [7424/48750]	Loss: 0.1900	LR: 0.100000
Training Epoch: 4 [7680/48750]	Loss: 0.2053	LR: 0.100000
Training Epoch: 4 [7936/48750]	Loss: 0.1855	LR: 0.100000
Training Epoch: 4 [8192/48750]	Loss: 0.1935	LR: 0.100000
Training Epoch: 4 [8448/48750]	Loss: 0.1301	LR: 0.100000
Training Epoch: 4 [8704/48750]	Loss: 0.1180	LR: 0.100000
Training Epoch: 4 [8960/48750]	Loss: 0.1265	LR: 0.100000
Training Epoch: 4 [9216/48750]	Loss: 0.1345	LR: 0.100000
Training Epoch: 4 [9472/48750]	Loss: 0.0543	LR: 0.100000
Training Epoch: 4 [9728/48750]	Loss: 0.1798	LR: 0.100000
Training Epoch: 4 [9984/48750]	Loss: 0.1458	LR: 0.100000
Training Epoch: 4 [10240/48750]	Loss: 0.1230	LR: 0.100000
Training Epoch: 4 [10496/48750]	Loss: 0.1143	LR: 0.100000
Training Epoch: 4 [10752/48750]	Loss: 0.1469	LR: 0.100000
Training Epoch: 4 [11008/48750]	Loss: 0.1249	LR: 0.100000
Training Epoch: 4 [11264/48750]	Loss: 0.1505	LR: 0.100000
Training Epoch: 4 [11520/48750]	Loss: 0.1418	LR: 0.100000
Training Epoch: 4 [11776/48750]	Loss: 0.2069	LR: 0.100000
Training Epoch: 4 [12032/48750]	Loss: 0.1013	LR: 0.100000
Training Epoch: 4 [12288/48750]	Loss: 0.1000	LR: 0.100000
Training Epoch: 4 [12544/48750]	Loss: 0.1336	LR: 0.100000
Training Epoch: 4 [12800/48750]	Loss: 0.1315	LR: 0.100000
Training Epoch: 4 [13056/48750]	Loss: 0.0860	LR: 0.100000
Training Epoch: 4 [13312/48750]	Loss: 0.1185	LR: 0.100000
Training Epoch: 4 [13568/48750]	Loss: 0.2381	LR: 0.100000
Training Epoch: 4 [13824/48750]	Loss: 0.1576	LR: 0.100000
Training Epoch: 4 [14080/48750]	Loss: 0.1315	LR: 0.100000
Training Epoch: 4 [14336/48750]	Loss: 0.0804	LR: 0.100000
Training Epoch: 4 [14592/48750]	Loss: 0.1463	LR: 0.100000
Training Epoch: 4 [14848/48750]	Loss: 0.1140	LR: 0.100000
Training Epoch: 4 [15104/48750]	Loss: 0.1263	LR: 0.100000
Training Epoch: 4 [15360/48750]	Loss: 0.1016	LR: 0.100000
Training Epoch: 4 [15616/48750]	Loss: 0.1439	LR: 0.100000
Training Epoch: 4 [15872/48750]	Loss: 0.1280	LR: 0.100000
Training Epoch: 4 [16128/48750]	Loss: 0.1511	LR: 0.100000
Training Epoch: 4 [16384/48750]	Loss: 0.1757	LR: 0.100000
Training Epoch: 4 [16640/48750]	Loss: 0.1709	LR: 0.100000
Training Epoch: 4 [16896/48750]	Loss: 0.1762	LR: 0.100000
Training Epoch: 4 [17152/48750]	Loss: 0.0812	LR: 0.100000
Training Epoch: 4 [17408/48750]	Loss: 0.2118	LR: 0.100000
Training Epoch: 4 [17664/48750]	Loss: 0.1838	LR: 0.100000
Training Epoch: 4 [17920/48750]	Loss: 0.1291	LR: 0.100000
Training Epoch: 4 [18176/48750]	Loss: 0.1744	LR: 0.100000
Training Epoch: 4 [18432/48750]	Loss: 0.1724	LR: 0.100000
Training Epoch: 4 [18688/48750]	Loss: 0.1610	LR: 0.100000
Training Epoch: 4 [18944/48750]	Loss: 0.1813	LR: 0.100000
Training Epoch: 4 [19200/48750]	Loss: 0.2959	LR: 0.100000
Training Epoch: 4 [19456/48750]	Loss: 0.1244	LR: 0.100000
Training Epoch: 4 [19712/48750]	Loss: 0.1668	LR: 0.100000
Training Epoch: 4 [19968/48750]	Loss: 0.1035	LR: 0.100000
Training Epoch: 4 [20224/48750]	Loss: 0.2550	LR: 0.100000
Training Epoch: 4 [20480/48750]	Loss: 0.0869	LR: 0.100000
Training Epoch: 4 [20736/48750]	Loss: 0.1709	LR: 0.100000
Training Epoch: 4 [20992/48750]	Loss: 0.2228	LR: 0.100000
Training Epoch: 4 [21248/48750]	Loss: 0.1256	LR: 0.100000
Training Epoch: 4 [21504/48750]	Loss: 0.1953	LR: 0.100000
Training Epoch: 4 [21760/48750]	Loss: 0.1282	LR: 0.100000
Training Epoch: 4 [22016/48750]	Loss: 0.1299	LR: 0.100000
Training Epoch: 4 [22272/48750]	Loss: 0.1681	LR: 0.100000
Training Epoch: 4 [22528/48750]	Loss: 0.2045	LR: 0.100000
Training Epoch: 4 [22784/48750]	Loss: 0.1327	LR: 0.100000
Training Epoch: 4 [23040/48750]	Loss: 0.1539	LR: 0.100000
Training Epoch: 4 [23296/48750]	Loss: 0.1085	LR: 0.100000
Training Epoch: 4 [23552/48750]	Loss: 0.1330	LR: 0.100000
Training Epoch: 4 [23808/48750]	Loss: 0.1373	LR: 0.100000
Training Epoch: 4 [24064/48750]	Loss: 0.1202	LR: 0.100000
Training Epoch: 4 [24320/48750]	Loss: 0.1774	LR: 0.100000
Training Epoch: 4 [24576/48750]	Loss: 0.1085	LR: 0.100000
Training Epoch: 4 [24832/48750]	Loss: 0.1745	LR: 0.100000
Training Epoch: 4 [25088/48750]	Loss: 0.0870	LR: 0.100000
Training Epoch: 4 [25344/48750]	Loss: 0.1617	LR: 0.100000
Training Epoch: 4 [25600/48750]	Loss: 0.1273	LR: 0.100000
Training Epoch: 4 [25856/48750]	Loss: 0.1928	LR: 0.100000
Training Epoch: 4 [26112/48750]	Loss: 0.1070	LR: 0.100000
Training Epoch: 4 [26368/48750]	Loss: 0.1472	LR: 0.100000
Training Epoch: 4 [26624/48750]	Loss: 0.2026	LR: 0.100000
Training Epoch: 4 [26880/48750]	Loss: 0.1805	LR: 0.100000
Training Epoch: 4 [27136/48750]	Loss: 0.1282	LR: 0.100000
Training Epoch: 4 [27392/48750]	Loss: 0.1285	LR: 0.100000
Training Epoch: 4 [27648/48750]	Loss: 0.1704	LR: 0.100000
Training Epoch: 4 [27904/48750]	Loss: 0.1474	LR: 0.100000
Training Epoch: 4 [28160/48750]	Loss: 0.1259	LR: 0.100000
Training Epoch: 4 [28416/48750]	Loss: 0.1796	LR: 0.100000
Training Epoch: 4 [28672/48750]	Loss: 0.1938	LR: 0.100000
Training Epoch: 4 [28928/48750]	Loss: 0.1258	LR: 0.100000
Training Epoch: 4 [29184/48750]	Loss: 0.1170	LR: 0.100000
Training Epoch: 4 [29440/48750]	Loss: 0.1829	LR: 0.100000
Training Epoch: 4 [29696/48750]	Loss: 0.2598	LR: 0.100000
Training Epoch: 4 [29952/48750]	Loss: 0.1059	LR: 0.100000
Training Epoch: 4 [30208/48750]	Loss: 0.1933	LR: 0.100000
Training Epoch: 4 [30464/48750]	Loss: 0.2626	LR: 0.100000
Training Epoch: 4 [30720/48750]	Loss: 0.1475	LR: 0.100000
Training Epoch: 4 [30976/48750]	Loss: 0.1352	LR: 0.100000
Training Epoch: 4 [31232/48750]	Loss: 0.2317	LR: 0.100000
Training Epoch: 4 [31488/48750]	Loss: 0.1845	LR: 0.100000
Training Epoch: 4 [31744/48750]	Loss: 0.1130	LR: 0.100000
Training Epoch: 4 [32000/48750]	Loss: 0.1235	LR: 0.100000
Training Epoch: 4 [32256/48750]	Loss: 0.1157	LR: 0.100000
Training Epoch: 4 [32512/48750]	Loss: 0.0904	LR: 0.100000
Training Epoch: 4 [32768/48750]	Loss: 0.1719	LR: 0.100000
Training Epoch: 4 [33024/48750]	Loss: 0.0952	LR: 0.100000
Training Epoch: 4 [33280/48750]	Loss: 0.2317	LR: 0.100000
Training Epoch: 4 [33536/48750]	Loss: 0.1039	LR: 0.100000
Training Epoch: 4 [33792/48750]	Loss: 0.1316	LR: 0.100000
Training Epoch: 4 [34048/48750]	Loss: 0.1523	LR: 0.100000
Training Epoch: 4 [34304/48750]	Loss: 0.0927	LR: 0.100000
Training Epoch: 4 [34560/48750]	Loss: 0.1704	LR: 0.100000
Training Epoch: 4 [34816/48750]	Loss: 0.0990	LR: 0.100000
Training Epoch: 4 [35072/48750]	Loss: 0.1511	LR: 0.100000
Training Epoch: 4 [35328/48750]	Loss: 0.1314	LR: 0.100000
Training Epoch: 4 [35584/48750]	Loss: 0.1637	LR: 0.100000
Training Epoch: 4 [35840/48750]	Loss: 0.0841	LR: 0.100000
Training Epoch: 4 [36096/48750]	Loss: 0.1152	LR: 0.100000
Training Epoch: 4 [36352/48750]	Loss: 0.2185	LR: 0.100000
Training Epoch: 4 [36608/48750]	Loss: 0.1337	LR: 0.100000
Training Epoch: 4 [36864/48750]	Loss: 0.1677	LR: 0.100000
Training Epoch: 4 [37120/48750]	Loss: 0.1285	LR: 0.100000
Training Epoch: 4 [37376/48750]	Loss: 0.1418	LR: 0.100000
Training Epoch: 4 [37632/48750]	Loss: 0.0935	LR: 0.100000
Training Epoch: 4 [37888/48750]	Loss: 0.1491	LR: 0.100000
Training Epoch: 4 [38144/48750]	Loss: 0.1570	LR: 0.100000
Training Epoch: 4 [38400/48750]	Loss: 0.1729	LR: 0.100000
Training Epoch: 4 [38656/48750]	Loss: 0.1568	LR: 0.100000
Training Epoch: 4 [38912/48750]	Loss: 0.1185	LR: 0.100000
Training Epoch: 4 [39168/48750]	Loss: 0.1109	LR: 0.100000
Training Epoch: 4 [39424/48750]	Loss: 0.1758	LR: 0.100000
Training Epoch: 4 [39680/48750]	Loss: 0.1065	LR: 0.100000
Training Epoch: 4 [39936/48750]	Loss: 0.1095	LR: 0.100000
Training Epoch: 4 [40192/48750]	Loss: 0.1192	LR: 0.100000
Training Epoch: 4 [40448/48750]	Loss: 0.1682	LR: 0.100000
Training Epoch: 4 [40704/48750]	Loss: 0.1647	LR: 0.100000
Training Epoch: 4 [40960/48750]	Loss: 0.1356	LR: 0.100000
Training Epoch: 4 [41216/48750]	Loss: 0.1411	LR: 0.100000
Training Epoch: 4 [41472/48750]	Loss: 0.1919	LR: 0.100000
Training Epoch: 4 [41728/48750]	Loss: 0.1869	LR: 0.100000
Training Epoch: 4 [41984/48750]	Loss: 0.0983	LR: 0.100000
Training Epoch: 4 [42240/48750]	Loss: 0.1245	LR: 0.100000
Training Epoch: 4 [42496/48750]	Loss: 0.0949	LR: 0.100000
Training Epoch: 4 [42752/48750]	Loss: 0.1102	LR: 0.100000
Training Epoch: 4 [43008/48750]	Loss: 0.1436	LR: 0.100000
Training Epoch: 4 [43264/48750]	Loss: 0.1342	LR: 0.100000
Training Epoch: 4 [43520/48750]	Loss: 0.1676	LR: 0.100000
Training Epoch: 4 [43776/48750]	Loss: 0.1566	LR: 0.100000
Training Epoch: 4 [44032/48750]	Loss: 0.2191	LR: 0.100000
Training Epoch: 4 [44288/48750]	Loss: 0.1786	LR: 0.100000
Training Epoch: 4 [44544/48750]	Loss: 0.1467	LR: 0.100000
Training Epoch: 4 [44800/48750]	Loss: 0.2159	LR: 0.100000
Training Epoch: 4 [45056/48750]	Loss: 0.1005	LR: 0.100000
Training Epoch: 4 [45312/48750]	Loss: 0.0986	LR: 0.100000
Training Epoch: 4 [45568/48750]	Loss: 0.1046	LR: 0.100000
Training Epoch: 4 [45824/48750]	Loss: 0.1329	LR: 0.100000
Training Epoch: 4 [46080/48750]	Loss: 0.1159	LR: 0.100000
Training Epoch: 4 [46336/48750]	Loss: 0.1896	LR: 0.100000
Training Epoch: 4 [46592/48750]	Loss: 0.1270	LR: 0.100000
Training Epoch: 4 [46848/48750]	Loss: 0.1245	LR: 0.100000
Training Epoch: 4 [47104/48750]	Loss: 0.2097	LR: 0.100000
Training Epoch: 4 [47360/48750]	Loss: 0.1235	LR: 0.100000
Training Epoch: 4 [47616/48750]	Loss: 0.1300	LR: 0.100000
Training Epoch: 4 [47872/48750]	Loss: 0.1804	LR: 0.100000
Training Epoch: 4 [48128/48750]	Loss: 0.1819	LR: 0.100000
Training Epoch: 4 [48384/48750]	Loss: 0.1471	LR: 0.100000
Training Epoch: 4 [48640/48750]	Loss: 0.2050	LR: 0.100000
Training Epoch: 4 [48750/48750]	Loss: 0.0774	LR: 0.100000
Epoch 4 - Average Train Loss: 0.1437, Train Accuracy: 0.9510
Epoch 4 training time consumed: 351.63s
Evaluating Network.....
Test set: Epoch: 4, Average loss: 0.0005, Accuracy: 0.9615, Time consumed:23.47s
Training Epoch: 5 [256/48750]	Loss: 0.1278	LR: 0.100000
Training Epoch: 5 [512/48750]	Loss: 0.1793	LR: 0.100000
Training Epoch: 5 [768/48750]	Loss: 0.1792	LR: 0.100000
Training Epoch: 5 [1024/48750]	Loss: 0.2149	LR: 0.100000
Training Epoch: 5 [1280/48750]	Loss: 0.0771	LR: 0.100000
Training Epoch: 5 [1536/48750]	Loss: 0.1597	LR: 0.100000
Training Epoch: 5 [1792/48750]	Loss: 0.1892	LR: 0.100000
Training Epoch: 5 [2048/48750]	Loss: 0.1292	LR: 0.100000
Training Epoch: 5 [2304/48750]	Loss: 0.1212	LR: 0.100000
Training Epoch: 5 [2560/48750]	Loss: 0.1321	LR: 0.100000
Training Epoch: 5 [2816/48750]	Loss: 0.1046	LR: 0.100000
Training Epoch: 5 [3072/48750]	Loss: 0.1524	LR: 0.100000
Training Epoch: 5 [3328/48750]	Loss: 0.1085	LR: 0.100000
Training Epoch: 5 [3584/48750]	Loss: 0.1818	LR: 0.100000
Training Epoch: 5 [3840/48750]	Loss: 0.1428	LR: 0.100000
Training Epoch: 5 [4096/48750]	Loss: 0.1675	LR: 0.100000
Training Epoch: 5 [4352/48750]	Loss: 0.2593	LR: 0.100000
Training Epoch: 5 [4608/48750]	Loss: 0.1526	LR: 0.100000
Training Epoch: 5 [4864/48750]	Loss: 0.1655	LR: 0.100000
Training Epoch: 5 [5120/48750]	Loss: 0.2466	LR: 0.100000
Training Epoch: 5 [5376/48750]	Loss: 0.1405	LR: 0.100000
Training Epoch: 5 [5632/48750]	Loss: 0.2424	LR: 0.100000
Training Epoch: 5 [5888/48750]	Loss: 0.2091	LR: 0.100000
Training Epoch: 5 [6144/48750]	Loss: 0.1295	LR: 0.100000
Training Epoch: 5 [6400/48750]	Loss: 0.2319	LR: 0.100000
Training Epoch: 5 [6656/48750]	Loss: 0.2094	LR: 0.100000
Training Epoch: 5 [6912/48750]	Loss: 0.1290	LR: 0.100000
Training Epoch: 5 [7168/48750]	Loss: 0.1615	LR: 0.100000
Training Epoch: 5 [7424/48750]	Loss: 0.2516	LR: 0.100000
Training Epoch: 5 [7680/48750]	Loss: 0.1407	LR: 0.100000
Training Epoch: 5 [7936/48750]	Loss: 0.1892	LR: 0.100000
Training Epoch: 5 [8192/48750]	Loss: 0.1387	LR: 0.100000
Training Epoch: 5 [8448/48750]	Loss: 0.1319	LR: 0.100000
Training Epoch: 5 [8704/48750]	Loss: 0.1490	LR: 0.100000
Training Epoch: 5 [8960/48750]	Loss: 0.1342	LR: 0.100000
Training Epoch: 5 [9216/48750]	Loss: 0.1143	LR: 0.100000
Training Epoch: 5 [9472/48750]	Loss: 0.1289	LR: 0.100000
Training Epoch: 5 [9728/48750]	Loss: 0.0918	LR: 0.100000
Training Epoch: 5 [9984/48750]	Loss: 0.1060	LR: 0.100000
Training Epoch: 5 [10240/48750]	Loss: 0.1216	LR: 0.100000
Training Epoch: 5 [10496/48750]	Loss: 0.1432	LR: 0.100000
Training Epoch: 5 [10752/48750]	Loss: 0.1225	LR: 0.100000
Training Epoch: 5 [11008/48750]	Loss: 0.1272	LR: 0.100000
Training Epoch: 5 [11264/48750]	Loss: 0.2608	LR: 0.100000
Training Epoch: 5 [11520/48750]	Loss: 0.2016	LR: 0.100000
Training Epoch: 5 [11776/48750]	Loss: 0.0996	LR: 0.100000
Training Epoch: 5 [12032/48750]	Loss: 0.1853	LR: 0.100000
Training Epoch: 5 [12288/48750]	Loss: 0.1347	LR: 0.100000
Training Epoch: 5 [12544/48750]	Loss: 0.1673	LR: 0.100000
Training Epoch: 5 [12800/48750]	Loss: 0.1120	LR: 0.100000
Training Epoch: 5 [13056/48750]	Loss: 0.0749	LR: 0.100000
Training Epoch: 5 [13312/48750]	Loss: 0.1459	LR: 0.100000
Training Epoch: 5 [13568/48750]	Loss: 0.1629	LR: 0.100000
Training Epoch: 5 [13824/48750]	Loss: 0.1061	LR: 0.100000
Training Epoch: 5 [14080/48750]	Loss: 0.0966	LR: 0.100000
Training Epoch: 5 [14336/48750]	Loss: 0.2211	LR: 0.100000
Training Epoch: 5 [14592/48750]	Loss: 0.1827	LR: 0.100000
Training Epoch: 5 [14848/48750]	Loss: 0.1434	LR: 0.100000
Training Epoch: 5 [15104/48750]	Loss: 0.1144	LR: 0.100000
Training Epoch: 5 [15360/48750]	Loss: 0.1061	LR: 0.100000
Training Epoch: 5 [15616/48750]	Loss: 0.1422	LR: 0.100000
Training Epoch: 5 [15872/48750]	Loss: 0.1375	LR: 0.100000
Training Epoch: 5 [16128/48750]	Loss: 0.1741	LR: 0.100000
Training Epoch: 5 [16384/48750]	Loss: 0.1397	LR: 0.100000
Training Epoch: 5 [16640/48750]	Loss: 0.1045	LR: 0.100000
Training Epoch: 5 [16896/48750]	Loss: 0.0798	LR: 0.100000
Training Epoch: 5 [17152/48750]	Loss: 0.1600	LR: 0.100000
Training Epoch: 5 [17408/48750]	Loss: 0.1864	LR: 0.100000
Training Epoch: 5 [17664/48750]	Loss: 0.1460	LR: 0.100000
Training Epoch: 5 [17920/48750]	Loss: 0.1683	LR: 0.100000
Training Epoch: 5 [18176/48750]	Loss: 0.1653	LR: 0.100000
Training Epoch: 5 [18432/48750]	Loss: 0.1774	LR: 0.100000
Training Epoch: 5 [18688/48750]	Loss: 0.1284	LR: 0.100000
Training Epoch: 5 [18944/48750]	Loss: 0.2039	LR: 0.100000
Training Epoch: 5 [19200/48750]	Loss: 0.1050	LR: 0.100000
Training Epoch: 5 [19456/48750]	Loss: 0.1872	LR: 0.100000
Training Epoch: 5 [19712/48750]	Loss: 0.1339	LR: 0.100000
Training Epoch: 5 [19968/48750]	Loss: 0.2199	LR: 0.100000
Training Epoch: 5 [20224/48750]	Loss: 0.1672	LR: 0.100000
Training Epoch: 5 [20480/48750]	Loss: 0.1813	LR: 0.100000
Training Epoch: 5 [20736/48750]	Loss: 0.1056	LR: 0.100000
Training Epoch: 5 [20992/48750]	Loss: 0.1747	LR: 0.100000
Training Epoch: 5 [21248/48750]	Loss: 0.1288	LR: 0.100000
Training Epoch: 5 [21504/48750]	Loss: 0.2150	LR: 0.100000
Training Epoch: 5 [21760/48750]	Loss: 0.2149	LR: 0.100000
Training Epoch: 5 [22016/48750]	Loss: 0.0946	LR: 0.100000
Training Epoch: 5 [22272/48750]	Loss: 0.1024	LR: 0.100000
Training Epoch: 5 [22528/48750]	Loss: 0.2479	LR: 0.100000
Training Epoch: 5 [22784/48750]	Loss: 0.1625	LR: 0.100000
Training Epoch: 5 [23040/48750]	Loss: 0.1387	LR: 0.100000
Training Epoch: 5 [23296/48750]	Loss: 0.0650	LR: 0.100000
Training Epoch: 5 [23552/48750]	Loss: 0.1462	LR: 0.100000
Training Epoch: 5 [23808/48750]	Loss: 0.1305	LR: 0.100000
Training Epoch: 5 [24064/48750]	Loss: 0.1448	LR: 0.100000
Training Epoch: 5 [24320/48750]	Loss: 0.0992	LR: 0.100000
Training Epoch: 5 [24576/48750]	Loss: 0.1317	LR: 0.100000
Training Epoch: 5 [24832/48750]	Loss: 0.1747	LR: 0.100000
Training Epoch: 5 [25088/48750]	Loss: 0.1914	LR: 0.100000
Training Epoch: 5 [25344/48750]	Loss: 0.1161	LR: 0.100000
Training Epoch: 5 [25600/48750]	Loss: 0.1688	LR: 0.100000
Training Epoch: 5 [25856/48750]	Loss: 0.1618	LR: 0.100000
Training Epoch: 5 [26112/48750]	Loss: 0.0959	LR: 0.100000
Training Epoch: 5 [26368/48750]	Loss: 0.1256	LR: 0.100000
Training Epoch: 5 [26624/48750]	Loss: 0.1825	LR: 0.100000
Training Epoch: 5 [26880/48750]	Loss: 0.1173	LR: 0.100000
Training Epoch: 5 [27136/48750]	Loss: 0.1018	LR: 0.100000
Training Epoch: 5 [27392/48750]	Loss: 0.1700	LR: 0.100000
Training Epoch: 5 [27648/48750]	Loss: 0.1467	LR: 0.100000
Training Epoch: 5 [27904/48750]	Loss: 0.0654	LR: 0.100000
Training Epoch: 5 [28160/48750]	Loss: 0.1921	LR: 0.100000
Training Epoch: 5 [28416/48750]	Loss: 0.2348	LR: 0.100000
Training Epoch: 5 [28672/48750]	Loss: 0.2764	LR: 0.100000
Training Epoch: 5 [28928/48750]	Loss: 0.1163	LR: 0.100000
Training Epoch: 5 [29184/48750]	Loss: 0.1449	LR: 0.100000
Training Epoch: 5 [29440/48750]	Loss: 0.2717	LR: 0.100000
Training Epoch: 5 [29696/48750]	Loss: 0.2036	LR: 0.100000
Training Epoch: 5 [29952/48750]	Loss: 0.2259	LR: 0.100000
Training Epoch: 5 [30208/48750]	Loss: 0.1431	LR: 0.100000
Training Epoch: 5 [30464/48750]	Loss: 0.2256	LR: 0.100000
Training Epoch: 5 [30720/48750]	Loss: 0.1951	LR: 0.100000
Training Epoch: 5 [30976/48750]	Loss: 0.1415	LR: 0.100000
Training Epoch: 5 [31232/48750]	Loss: 0.1361	LR: 0.100000
Training Epoch: 5 [31488/48750]	Loss: 0.1697	LR: 0.100000
Training Epoch: 5 [31744/48750]	Loss: 0.1322	LR: 0.100000
Training Epoch: 5 [32000/48750]	Loss: 0.2440	LR: 0.100000
Training Epoch: 5 [32256/48750]	Loss: 0.1378	LR: 0.100000
Training Epoch: 5 [32512/48750]	Loss: 0.1265	LR: 0.100000
Training Epoch: 5 [32768/48750]	Loss: 0.1534	LR: 0.100000
Training Epoch: 5 [33024/48750]	Loss: 0.2535	LR: 0.100000
Training Epoch: 5 [33280/48750]	Loss: 0.1148	LR: 0.100000
Training Epoch: 5 [33536/48750]	Loss: 0.1846	LR: 0.100000
Training Epoch: 5 [33792/48750]	Loss: 0.1246	LR: 0.100000
Training Epoch: 5 [34048/48750]	Loss: 0.1739	LR: 0.100000
Training Epoch: 5 [34304/48750]	Loss: 0.1902	LR: 0.100000
Training Epoch: 5 [34560/48750]	Loss: 0.0922	LR: 0.100000
Training Epoch: 5 [34816/48750]	Loss: 0.0838	LR: 0.100000
Training Epoch: 5 [35072/48750]	Loss: 0.1319	LR: 0.100000
Training Epoch: 5 [35328/48750]	Loss: 0.1171	LR: 0.100000
Training Epoch: 5 [35584/48750]	Loss: 0.1047	LR: 0.100000
Training Epoch: 5 [35840/48750]	Loss: 0.1534	LR: 0.100000
Training Epoch: 5 [36096/48750]	Loss: 0.1306	LR: 0.100000
Training Epoch: 5 [36352/48750]	Loss: 0.1894	LR: 0.100000
Training Epoch: 5 [36608/48750]	Loss: 0.1822	LR: 0.100000
Training Epoch: 5 [36864/48750]	Loss: 0.1888	LR: 0.100000
Training Epoch: 5 [37120/48750]	Loss: 0.1668	LR: 0.100000
Training Epoch: 5 [37376/48750]	Loss: 0.1932	LR: 0.100000
Training Epoch: 5 [37632/48750]	Loss: 0.1830	LR: 0.100000
Training Epoch: 5 [37888/48750]	Loss: 0.1499	LR: 0.100000
Training Epoch: 5 [38144/48750]	Loss: 0.2246	LR: 0.100000
Training Epoch: 5 [38400/48750]	Loss: 0.2568	LR: 0.100000
Training Epoch: 5 [38656/48750]	Loss: 0.1926	LR: 0.100000
Training Epoch: 5 [38912/48750]	Loss: 0.1712	LR: 0.100000
Training Epoch: 5 [39168/48750]	Loss: 0.2070	LR: 0.100000
Training Epoch: 5 [39424/48750]	Loss: 0.1554	LR: 0.100000
Training Epoch: 5 [39680/48750]	Loss: 0.1614	LR: 0.100000
Training Epoch: 5 [39936/48750]	Loss: 0.2060	LR: 0.100000
Training Epoch: 5 [40192/48750]	Loss: 0.2380	LR: 0.100000
Training Epoch: 5 [40448/48750]	Loss: 0.2828	LR: 0.100000
Training Epoch: 5 [40704/48750]	Loss: 0.1858	LR: 0.100000
Training Epoch: 5 [40960/48750]	Loss: 0.2458	LR: 0.100000
Training Epoch: 5 [41216/48750]	Loss: 0.1992	LR: 0.100000
Training Epoch: 5 [41472/48750]	Loss: 0.1553	LR: 0.100000
Training Epoch: 5 [41728/48750]	Loss: 0.1922	LR: 0.100000
Training Epoch: 5 [41984/48750]	Loss: 0.2334	LR: 0.100000
Training Epoch: 5 [42240/48750]	Loss: 0.2598	LR: 0.100000
Training Epoch: 5 [42496/48750]	Loss: 0.2752	LR: 0.100000
Training Epoch: 5 [42752/48750]	Loss: 0.2398	LR: 0.100000
Training Epoch: 5 [43008/48750]	Loss: 0.1495	LR: 0.100000
Training Epoch: 5 [43264/48750]	Loss: 0.2192	LR: 0.100000
Training Epoch: 5 [43520/48750]	Loss: 0.1445	LR: 0.100000
Training Epoch: 5 [43776/48750]	Loss: 0.2365	LR: 0.100000
Training Epoch: 5 [44032/48750]	Loss: 0.1984	LR: 0.100000
Training Epoch: 5 [44288/48750]	Loss: 0.1462	LR: 0.100000
Training Epoch: 5 [44544/48750]	Loss: 0.2698	LR: 0.100000
Training Epoch: 5 [44800/48750]	Loss: 0.1649	LR: 0.100000
Training Epoch: 5 [45056/48750]	Loss: 0.1521	LR: 0.100000
Training Epoch: 5 [45312/48750]	Loss: 0.2034	LR: 0.100000
Training Epoch: 5 [45568/48750]	Loss: 0.1702	LR: 0.100000
Training Epoch: 5 [45824/48750]	Loss: 0.1952	LR: 0.100000
Training Epoch: 5 [46080/48750]	Loss: 0.2003	LR: 0.100000
Training Epoch: 5 [46336/48750]	Loss: 0.1469	LR: 0.100000
Training Epoch: 5 [46592/48750]	Loss: 0.0854	LR: 0.100000
Training Epoch: 5 [46848/48750]	Loss: 0.1563	LR: 0.100000
Training Epoch: 5 [47104/48750]	Loss: 0.1675	LR: 0.100000
Training Epoch: 5 [47360/48750]	Loss: 0.1291	LR: 0.100000
Training Epoch: 5 [47616/48750]	Loss: 0.1570	LR: 0.100000
Training Epoch: 5 [47872/48750]	Loss: 0.1479	LR: 0.100000
Training Epoch: 5 [48128/48750]	Loss: 0.1644	LR: 0.100000
Training Epoch: 5 [48384/48750]	Loss: 0.1398	LR: 0.100000
Training Epoch: 5 [48640/48750]	Loss: 0.1229	LR: 0.100000
Training Epoch: 5 [48750/48750]	Loss: 0.0878	LR: 0.100000
Epoch 5 - Average Train Loss: 0.1630, Train Accuracy: 0.9444
Epoch 5 training time consumed: 352.05s
Evaluating Network.....
Test set: Epoch: 5, Average loss: 0.0007, Accuracy: 0.9419, Time consumed:23.46s
Training Epoch: 6 [256/48750]	Loss: 0.1061	LR: 0.100000
Training Epoch: 6 [512/48750]	Loss: 0.1637	LR: 0.100000
Training Epoch: 6 [768/48750]	Loss: 0.0764	LR: 0.100000
Training Epoch: 6 [1024/48750]	Loss: 0.0622	LR: 0.100000
Training Epoch: 6 [1280/48750]	Loss: 0.2113	LR: 0.100000
Training Epoch: 6 [1536/48750]	Loss: 0.2074	LR: 0.100000
Training Epoch: 6 [1792/48750]	Loss: 0.1703	LR: 0.100000
Training Epoch: 6 [2048/48750]	Loss: 0.2044	LR: 0.100000
Training Epoch: 6 [2304/48750]	Loss: 0.1412	LR: 0.100000
Training Epoch: 6 [2560/48750]	Loss: 0.1433	LR: 0.100000
Training Epoch: 6 [2816/48750]	Loss: 0.1541	LR: 0.100000
Training Epoch: 6 [3072/48750]	Loss: 0.1777	LR: 0.100000
Training Epoch: 6 [3328/48750]	Loss: 0.1336	LR: 0.100000
Training Epoch: 6 [3584/48750]	Loss: 0.1590	LR: 0.100000
Training Epoch: 6 [3840/48750]	Loss: 0.1810	LR: 0.100000
Training Epoch: 6 [4096/48750]	Loss: 0.1995	LR: 0.100000
Training Epoch: 6 [4352/48750]	Loss: 0.1277	LR: 0.100000
Training Epoch: 6 [4608/48750]	Loss: 0.1805	LR: 0.100000
Training Epoch: 6 [4864/48750]	Loss: 0.1674	LR: 0.100000
Training Epoch: 6 [5120/48750]	Loss: 0.1550	LR: 0.100000
Training Epoch: 6 [5376/48750]	Loss: 0.1434	LR: 0.100000
Training Epoch: 6 [5632/48750]	Loss: 0.1457	LR: 0.100000
Training Epoch: 6 [5888/48750]	Loss: 0.0894	LR: 0.100000
Training Epoch: 6 [6144/48750]	Loss: 0.2151	LR: 0.100000
Training Epoch: 6 [6400/48750]	Loss: 0.1095	LR: 0.100000
Training Epoch: 6 [6656/48750]	Loss: 0.1765	LR: 0.100000
Training Epoch: 6 [6912/48750]	Loss: 0.1557	LR: 0.100000
Training Epoch: 6 [7168/48750]	Loss: 0.2143	LR: 0.100000
Training Epoch: 6 [7424/48750]	Loss: 0.0819	LR: 0.100000
Training Epoch: 6 [7680/48750]	Loss: 0.1294	LR: 0.100000
Training Epoch: 6 [7936/48750]	Loss: 0.1520	LR: 0.100000
Training Epoch: 6 [8192/48750]	Loss: 0.1630	LR: 0.100000
Training Epoch: 6 [8448/48750]	Loss: 0.1258	LR: 0.100000
Training Epoch: 6 [8704/48750]	Loss: 0.1419	LR: 0.100000
Training Epoch: 6 [8960/48750]	Loss: 0.1891	LR: 0.100000
Training Epoch: 6 [9216/48750]	Loss: 0.1518	LR: 0.100000
Training Epoch: 6 [9472/48750]	Loss: 0.1026	LR: 0.100000
Training Epoch: 6 [9728/48750]	Loss: 0.2697	LR: 0.100000
Training Epoch: 6 [9984/48750]	Loss: 0.1921	LR: 0.100000
Training Epoch: 6 [10240/48750]	Loss: 0.1421	LR: 0.100000
Training Epoch: 6 [10496/48750]	Loss: 0.1157	LR: 0.100000
Training Epoch: 6 [10752/48750]	Loss: 0.1884	LR: 0.100000
Training Epoch: 6 [11008/48750]	Loss: 0.1215	LR: 0.100000
Training Epoch: 6 [11264/48750]	Loss: 0.1276	LR: 0.100000
Training Epoch: 6 [11520/48750]	Loss: 0.1372	LR: 0.100000
Training Epoch: 6 [11776/48750]	Loss: 0.1643	LR: 0.100000
Training Epoch: 6 [12032/48750]	Loss: 0.1073	LR: 0.100000
Training Epoch: 6 [12288/48750]	Loss: 0.1522	LR: 0.100000
Training Epoch: 6 [12544/48750]	Loss: 0.1537	LR: 0.100000
Training Epoch: 6 [12800/48750]	Loss: 0.1803	LR: 0.100000
Training Epoch: 6 [13056/48750]	Loss: 0.1304	LR: 0.100000
Training Epoch: 6 [13312/48750]	Loss: 0.1788	LR: 0.100000
Training Epoch: 6 [13568/48750]	Loss: 0.1872	LR: 0.100000
Training Epoch: 6 [13824/48750]	Loss: 0.1113	LR: 0.100000
Training Epoch: 6 [14080/48750]	Loss: 0.1621	LR: 0.100000
Training Epoch: 6 [14336/48750]	Loss: 0.2546	LR: 0.100000
Training Epoch: 6 [14592/48750]	Loss: 0.2202	LR: 0.100000
Training Epoch: 6 [14848/48750]	Loss: 0.1748	LR: 0.100000
Training Epoch: 6 [15104/48750]	Loss: 0.1830	LR: 0.100000
Training Epoch: 6 [15360/48750]	Loss: 0.2491	LR: 0.100000
Training Epoch: 6 [15616/48750]	Loss: 0.1059	LR: 0.100000
Training Epoch: 6 [15872/48750]	Loss: 0.2482	LR: 0.100000
Training Epoch: 6 [16128/48750]	Loss: 0.1495	LR: 0.100000
Training Epoch: 6 [16384/48750]	Loss: 0.2294	LR: 0.100000
Training Epoch: 6 [16640/48750]	Loss: 0.2045	LR: 0.100000
Training Epoch: 6 [16896/48750]	Loss: 0.2066	LR: 0.100000
Training Epoch: 6 [17152/48750]	Loss: 0.1946	LR: 0.100000
Training Epoch: 6 [17408/48750]	Loss: 0.1306	LR: 0.100000
Training Epoch: 6 [17664/48750]	Loss: 0.1237	LR: 0.100000
Training Epoch: 6 [17920/48750]	Loss: 0.1619	LR: 0.100000
Training Epoch: 6 [18176/48750]	Loss: 0.1836	LR: 0.100000
Training Epoch: 6 [18432/48750]	Loss: 0.1384	LR: 0.100000
Training Epoch: 6 [18688/48750]	Loss: 0.1931	LR: 0.100000
Training Epoch: 6 [18944/48750]	Loss: 0.1195	LR: 0.100000
Training Epoch: 6 [19200/48750]	Loss: 0.1607	LR: 0.100000
Training Epoch: 6 [19456/48750]	Loss: 0.2221	LR: 0.100000
Training Epoch: 6 [19712/48750]	Loss: 0.1648	LR: 0.100000
Training Epoch: 6 [19968/48750]	Loss: 0.1322	LR: 0.100000
Training Epoch: 6 [20224/48750]	Loss: 0.1536	LR: 0.100000
Training Epoch: 6 [20480/48750]	Loss: 0.1832	LR: 0.100000
Training Epoch: 6 [20736/48750]	Loss: 0.1112	LR: 0.100000
Training Epoch: 6 [20992/48750]	Loss: 0.1250	LR: 0.100000
Training Epoch: 6 [21248/48750]	Loss: 0.1035	LR: 0.100000
Training Epoch: 6 [21504/48750]	Loss: 0.1455	LR: 0.100000
Training Epoch: 6 [21760/48750]	Loss: 0.1700	LR: 0.100000
Training Epoch: 6 [22016/48750]	Loss: 0.1044	LR: 0.100000
Training Epoch: 6 [22272/48750]	Loss: 0.1547	LR: 0.100000
Training Epoch: 6 [22528/48750]	Loss: 0.1419	LR: 0.100000
Training Epoch: 6 [22784/48750]	Loss: 0.1787	LR: 0.100000
Training Epoch: 6 [23040/48750]	Loss: 0.1841	LR: 0.100000
Training Epoch: 6 [23296/48750]	Loss: 0.1464	LR: 0.100000
Training Epoch: 6 [23552/48750]	Loss: 0.1135	LR: 0.100000
Training Epoch: 6 [23808/48750]	Loss: 0.2196	LR: 0.100000
Training Epoch: 6 [24064/48750]	Loss: 0.1039	LR: 0.100000
Training Epoch: 6 [24320/48750]	Loss: 0.1672	LR: 0.100000
Training Epoch: 6 [24576/48750]	Loss: 0.1285	LR: 0.100000
Training Epoch: 6 [24832/48750]	Loss: 0.1730	LR: 0.100000
Training Epoch: 6 [25088/48750]	Loss: 0.2389	LR: 0.100000
Training Epoch: 6 [25344/48750]	Loss: 0.1957	LR: 0.100000
Training Epoch: 6 [25600/48750]	Loss: 0.2900	LR: 0.100000
Training Epoch: 6 [25856/48750]	Loss: 0.1668	LR: 0.100000
Training Epoch: 6 [26112/48750]	Loss: 0.1669	LR: 0.100000
Training Epoch: 6 [26368/48750]	Loss: 0.2189	LR: 0.100000
Training Epoch: 6 [26624/48750]	Loss: 0.1857	LR: 0.100000
Training Epoch: 6 [26880/48750]	Loss: 0.1846	LR: 0.100000
Training Epoch: 6 [27136/48750]	Loss: 0.1839	LR: 0.100000
Training Epoch: 6 [27392/48750]	Loss: 0.2035	LR: 0.100000
Training Epoch: 6 [27648/48750]	Loss: 0.1443	LR: 0.100000
Training Epoch: 6 [27904/48750]	Loss: 0.1667	LR: 0.100000
Training Epoch: 6 [28160/48750]	Loss: 0.2280	LR: 0.100000
Training Epoch: 6 [28416/48750]	Loss: 0.1920	LR: 0.100000
Training Epoch: 6 [28672/48750]	Loss: 0.1675	LR: 0.100000
Training Epoch: 6 [28928/48750]	Loss: 0.2372	LR: 0.100000
Training Epoch: 6 [29184/48750]	Loss: 0.1999	LR: 0.100000
Training Epoch: 6 [29440/48750]	Loss: 0.2526	LR: 0.100000
Training Epoch: 6 [29696/48750]	Loss: 0.2305	LR: 0.100000
Training Epoch: 6 [29952/48750]	Loss: 0.1870	LR: 0.100000
Training Epoch: 6 [30208/48750]	Loss: 0.2089	LR: 0.100000
Training Epoch: 6 [30464/48750]	Loss: 0.1260	LR: 0.100000
Training Epoch: 6 [30720/48750]	Loss: 0.1518	LR: 0.100000
Training Epoch: 6 [30976/48750]	Loss: 0.2737	LR: 0.100000
Training Epoch: 6 [31232/48750]	Loss: 0.1980	LR: 0.100000
Training Epoch: 6 [31488/48750]	Loss: 0.1167	LR: 0.100000
Training Epoch: 6 [31744/48750]	Loss: 0.1538	LR: 0.100000
Training Epoch: 6 [32000/48750]	Loss: 0.2113	LR: 0.100000
Training Epoch: 6 [32256/48750]	Loss: 0.1266	LR: 0.100000
Training Epoch: 6 [32512/48750]	Loss: 0.1517	LR: 0.100000
Training Epoch: 6 [32768/48750]	Loss: 0.1783	LR: 0.100000
Training Epoch: 6 [33024/48750]	Loss: 0.0930	LR: 0.100000
Training Epoch: 6 [33280/48750]	Loss: 0.1203	LR: 0.100000
Training Epoch: 6 [33536/48750]	Loss: 0.2217	LR: 0.100000
Training Epoch: 6 [33792/48750]	Loss: 0.2333	LR: 0.100000
Training Epoch: 6 [34048/48750]	Loss: 0.2457	LR: 0.100000
Training Epoch: 6 [34304/48750]	Loss: 0.2192	LR: 0.100000
Training Epoch: 6 [34560/48750]	Loss: 0.1173	LR: 0.100000
Training Epoch: 6 [34816/48750]	Loss: 0.2527	LR: 0.100000
Training Epoch: 6 [35072/48750]	Loss: 0.2054	LR: 0.100000
Training Epoch: 6 [35328/48750]	Loss: 0.1440	LR: 0.100000
Training Epoch: 6 [35584/48750]	Loss: 0.1584	LR: 0.100000
Training Epoch: 6 [35840/48750]	Loss: 0.2930	LR: 0.100000
Training Epoch: 6 [36096/48750]	Loss: 0.2071	LR: 0.100000
Training Epoch: 6 [36352/48750]	Loss: 0.1356	LR: 0.100000
Training Epoch: 6 [36608/48750]	Loss: 0.2238	LR: 0.100000
Training Epoch: 6 [36864/48750]	Loss: 0.2312	LR: 0.100000
Training Epoch: 6 [37120/48750]	Loss: 0.1615	LR: 0.100000
Training Epoch: 6 [37376/48750]	Loss: 0.1617	LR: 0.100000
Training Epoch: 6 [37632/48750]	Loss: 0.1913	LR: 0.100000
Training Epoch: 6 [37888/48750]	Loss: 0.1378	LR: 0.100000
Training Epoch: 6 [38144/48750]	Loss: 0.2278	LR: 0.100000
Training Epoch: 6 [38400/48750]	Loss: 0.2382	LR: 0.100000
Training Epoch: 6 [38656/48750]	Loss: 0.1274	LR: 0.100000
Training Epoch: 6 [38912/48750]	Loss: 0.1872	LR: 0.100000
Training Epoch: 6 [39168/48750]	Loss: 0.1609	LR: 0.100000
Training Epoch: 6 [39424/48750]	Loss: 0.2068	LR: 0.100000
Training Epoch: 6 [39680/48750]	Loss: 0.1918	LR: 0.100000
Training Epoch: 6 [39936/48750]	Loss: 0.2566	LR: 0.100000
Training Epoch: 6 [40192/48750]	Loss: 0.2030	LR: 0.100000
Training Epoch: 6 [40448/48750]	Loss: 0.3227	LR: 0.100000
Training Epoch: 6 [40704/48750]	Loss: 0.2559	LR: 0.100000
Training Epoch: 6 [40960/48750]	Loss: 0.2477	LR: 0.100000
Training Epoch: 6 [41216/48750]	Loss: 0.3119	LR: 0.100000
Training Epoch: 6 [41472/48750]	Loss: 0.2129	LR: 0.100000
Training Epoch: 6 [41728/48750]	Loss: 0.2920	LR: 0.100000
Training Epoch: 6 [41984/48750]	Loss: 0.2147	LR: 0.100000
Training Epoch: 6 [42240/48750]	Loss: 0.2283	LR: 0.100000
Training Epoch: 6 [42496/48750]	Loss: 0.1962	LR: 0.100000
Training Epoch: 6 [42752/48750]	Loss: 0.1632	LR: 0.100000
Training Epoch: 6 [43008/48750]	Loss: 0.3378	LR: 0.100000
Training Epoch: 6 [43264/48750]	Loss: 0.2322	LR: 0.100000
Training Epoch: 6 [43520/48750]	Loss: 0.1997	LR: 0.100000
Training Epoch: 6 [43776/48750]	Loss: 0.2161	LR: 0.100000
Training Epoch: 6 [44032/48750]	Loss: 0.2216	LR: 0.100000
Training Epoch: 6 [44288/48750]	Loss: 0.2215	LR: 0.100000
Training Epoch: 6 [44544/48750]	Loss: 0.1700	LR: 0.100000
Training Epoch: 6 [44800/48750]	Loss: 0.2871	LR: 0.100000
Training Epoch: 6 [45056/48750]	Loss: 0.2555	LR: 0.100000
Training Epoch: 6 [45312/48750]	Loss: 0.2168	LR: 0.100000
Training Epoch: 6 [45568/48750]	Loss: 0.2770	LR: 0.100000
Training Epoch: 6 [45824/48750]	Loss: 0.2036	LR: 0.100000
Training Epoch: 6 [46080/48750]	Loss: 0.1640	LR: 0.100000
Training Epoch: 6 [46336/48750]	Loss: 0.3238	LR: 0.100000
Training Epoch: 6 [46592/48750]	Loss: 0.2095	LR: 0.100000
Training Epoch: 6 [46848/48750]	Loss: 0.2936	LR: 0.100000
Training Epoch: 6 [47104/48750]	Loss: 0.2214	LR: 0.100000
Training Epoch: 6 [47360/48750]	Loss: 0.1932	LR: 0.100000
Training Epoch: 6 [47616/48750]	Loss: 0.2930	LR: 0.100000
Training Epoch: 6 [47872/48750]	Loss: 0.2808	LR: 0.100000
Training Epoch: 6 [48128/48750]	Loss: 0.2733	LR: 0.100000
Training Epoch: 6 [48384/48750]	Loss: 0.2666	LR: 0.100000
Training Epoch: 6 [48640/48750]	Loss: 0.2233	LR: 0.100000
Training Epoch: 6 [48750/48750]	Loss: 0.1850	LR: 0.100000
Epoch 6 - Average Train Loss: 0.1837, Train Accuracy: 0.9362
Epoch 6 training time consumed: 352.23s
Evaluating Network.....
Test set: Epoch: 6, Average loss: 0.0009, Accuracy: 0.9283, Time consumed:23.45s
Training Epoch: 7 [256/48750]	Loss: 0.1753	LR: 0.020000
Training Epoch: 7 [512/48750]	Loss: 0.2624	LR: 0.020000
Training Epoch: 7 [768/48750]	Loss: 0.1462	LR: 0.020000
Training Epoch: 7 [1024/48750]	Loss: 0.1541	LR: 0.020000
Training Epoch: 7 [1280/48750]	Loss: 0.1372	LR: 0.020000
Training Epoch: 7 [1536/48750]	Loss: 0.1089	LR: 0.020000
Training Epoch: 7 [1792/48750]	Loss: 0.1054	LR: 0.020000
Training Epoch: 7 [2048/48750]	Loss: 0.1219	LR: 0.020000
Training Epoch: 7 [2304/48750]	Loss: 0.1958	LR: 0.020000
Training Epoch: 7 [2560/48750]	Loss: 0.1160	LR: 0.020000
Training Epoch: 7 [2816/48750]	Loss: 0.0999	LR: 0.020000
Training Epoch: 7 [3072/48750]	Loss: 0.1039	LR: 0.020000
Training Epoch: 7 [3328/48750]	Loss: 0.1247	LR: 0.020000
Training Epoch: 7 [3584/48750]	Loss: 0.0763	LR: 0.020000
Training Epoch: 7 [3840/48750]	Loss: 0.0764	LR: 0.020000
Training Epoch: 7 [4096/48750]	Loss: 0.1631	LR: 0.020000
Training Epoch: 7 [4352/48750]	Loss: 0.0631	LR: 0.020000
Training Epoch: 7 [4608/48750]	Loss: 0.1093	LR: 0.020000
Training Epoch: 7 [4864/48750]	Loss: 0.0925	LR: 0.020000
Training Epoch: 7 [5120/48750]	Loss: 0.1069	LR: 0.020000
Training Epoch: 7 [5376/48750]	Loss: 0.1132	LR: 0.020000
Training Epoch: 7 [5632/48750]	Loss: 0.1023	LR: 0.020000
Training Epoch: 7 [5888/48750]	Loss: 0.1261	LR: 0.020000
Training Epoch: 7 [6144/48750]	Loss: 0.0513	LR: 0.020000
Training Epoch: 7 [6400/48750]	Loss: 0.0712	LR: 0.020000
Training Epoch: 7 [6656/48750]	Loss: 0.0638	LR: 0.020000
Training Epoch: 7 [6912/48750]	Loss: 0.1088	LR: 0.020000
Training Epoch: 7 [7168/48750]	Loss: 0.0568	LR: 0.020000
Training Epoch: 7 [7424/48750]	Loss: 0.0684	LR: 0.020000
Training Epoch: 7 [7680/48750]	Loss: 0.0288	LR: 0.020000
Training Epoch: 7 [7936/48750]	Loss: 0.0587	LR: 0.020000
Training Epoch: 7 [8192/48750]	Loss: 0.0798	LR: 0.020000
Training Epoch: 7 [8448/48750]	Loss: 0.0558	LR: 0.020000
Training Epoch: 7 [8704/48750]	Loss: 0.0730	LR: 0.020000
Training Epoch: 7 [8960/48750]	Loss: 0.0609	LR: 0.020000
Training Epoch: 7 [9216/48750]	Loss: 0.0635	LR: 0.020000
Training Epoch: 7 [9472/48750]	Loss: 0.0349	LR: 0.020000
Training Epoch: 7 [9728/48750]	Loss: 0.0855	LR: 0.020000
Training Epoch: 7 [9984/48750]	Loss: 0.0756	LR: 0.020000
Training Epoch: 7 [10240/48750]	Loss: 0.0383	LR: 0.020000
Training Epoch: 7 [10496/48750]	Loss: 0.1285	LR: 0.020000
Training Epoch: 7 [10752/48750]	Loss: 0.0914	LR: 0.020000
Training Epoch: 7 [11008/48750]	Loss: 0.0973	LR: 0.020000
Training Epoch: 7 [11264/48750]	Loss: 0.1176	LR: 0.020000
Training Epoch: 7 [11520/48750]	Loss: 0.0685	LR: 0.020000
Training Epoch: 7 [11776/48750]	Loss: 0.0398	LR: 0.020000
Training Epoch: 7 [12032/48750]	Loss: 0.0515	LR: 0.020000
Training Epoch: 7 [12288/48750]	Loss: 0.0704	LR: 0.020000
Training Epoch: 7 [12544/48750]	Loss: 0.0910	LR: 0.020000
Training Epoch: 7 [12800/48750]	Loss: 0.0978	LR: 0.020000
Training Epoch: 7 [13056/48750]	Loss: 0.0994	LR: 0.020000
Training Epoch: 7 [13312/48750]	Loss: 0.0340	LR: 0.020000
Training Epoch: 7 [13568/48750]	Loss: 0.0445	LR: 0.020000
Training Epoch: 7 [13824/48750]	Loss: 0.0798	LR: 0.020000
Training Epoch: 7 [14080/48750]	Loss: 0.0601	LR: 0.020000
Training Epoch: 7 [14336/48750]	Loss: 0.0965	LR: 0.020000
Training Epoch: 7 [14592/48750]	Loss: 0.0790	LR: 0.020000
Training Epoch: 7 [14848/48750]	Loss: 0.0690	LR: 0.020000
Training Epoch: 7 [15104/48750]	Loss: 0.0386	LR: 0.020000
Training Epoch: 7 [15360/48750]	Loss: 0.0783	LR: 0.020000
Training Epoch: 7 [15616/48750]	Loss: 0.0842	LR: 0.020000
Training Epoch: 7 [15872/48750]	Loss: 0.0621	LR: 0.020000
Training Epoch: 7 [16128/48750]	Loss: 0.0337	LR: 0.020000
Training Epoch: 7 [16384/48750]	Loss: 0.0995	LR: 0.020000
Training Epoch: 7 [16640/48750]	Loss: 0.0231	LR: 0.020000
Training Epoch: 7 [16896/48750]	Loss: 0.0326	LR: 0.020000
Training Epoch: 7 [17152/48750]	Loss: 0.0392	LR: 0.020000
Training Epoch: 7 [17408/48750]	Loss: 0.0543	LR: 0.020000
Training Epoch: 7 [17664/48750]	Loss: 0.0569	LR: 0.020000
Training Epoch: 7 [17920/48750]	Loss: 0.0394	LR: 0.020000
Training Epoch: 7 [18176/48750]	Loss: 0.0661	LR: 0.020000
Training Epoch: 7 [18432/48750]	Loss: 0.0550	LR: 0.020000
Training Epoch: 7 [18688/48750]	Loss: 0.0371	LR: 0.020000
Training Epoch: 7 [18944/48750]	Loss: 0.0938	LR: 0.020000
Training Epoch: 7 [19200/48750]	Loss: 0.0426	LR: 0.020000
Training Epoch: 7 [19456/48750]	Loss: 0.0254	LR: 0.020000
Training Epoch: 7 [19712/48750]	Loss: 0.0635	LR: 0.020000
Training Epoch: 7 [19968/48750]	Loss: 0.0168	LR: 0.020000
Training Epoch: 7 [20224/48750]	Loss: 0.0387	LR: 0.020000
Training Epoch: 7 [20480/48750]	Loss: 0.0663	LR: 0.020000
Training Epoch: 7 [20736/48750]	Loss: 0.0983	LR: 0.020000
Training Epoch: 7 [20992/48750]	Loss: 0.1064	LR: 0.020000
Training Epoch: 7 [21248/48750]	Loss: 0.0510	LR: 0.020000
Training Epoch: 7 [21504/48750]	Loss: 0.0243	LR: 0.020000
Training Epoch: 7 [21760/48750]	Loss: 0.0581	LR: 0.020000
Training Epoch: 7 [22016/48750]	Loss: 0.0704	LR: 0.020000
Training Epoch: 7 [22272/48750]	Loss: 0.0540	LR: 0.020000
Training Epoch: 7 [22528/48750]	Loss: 0.0462	LR: 0.020000
Training Epoch: 7 [22784/48750]	Loss: 0.0691	LR: 0.020000
Training Epoch: 7 [23040/48750]	Loss: 0.0254	LR: 0.020000
Training Epoch: 7 [23296/48750]	Loss: 0.0837	LR: 0.020000
Training Epoch: 7 [23552/48750]	Loss: 0.0428	LR: 0.020000
Training Epoch: 7 [23808/48750]	Loss: 0.0557	LR: 0.020000
Training Epoch: 7 [24064/48750]	Loss: 0.0685	LR: 0.020000
Training Epoch: 7 [24320/48750]	Loss: 0.0759	LR: 0.020000
Training Epoch: 7 [24576/48750]	Loss: 0.1255	LR: 0.020000
Training Epoch: 7 [24832/48750]	Loss: 0.0295	LR: 0.020000
Training Epoch: 7 [25088/48750]	Loss: 0.0372	LR: 0.020000
Training Epoch: 7 [25344/48750]	Loss: 0.0270	LR: 0.020000
Training Epoch: 7 [25600/48750]	Loss: 0.0526	LR: 0.020000
Training Epoch: 7 [25856/48750]	Loss: 0.0577	LR: 0.020000
Training Epoch: 7 [26112/48750]	Loss: 0.0558	LR: 0.020000
Training Epoch: 7 [26368/48750]	Loss: 0.0717	LR: 0.020000
Training Epoch: 7 [26624/48750]	Loss: 0.0487	LR: 0.020000
Training Epoch: 7 [26880/48750]	Loss: 0.0886	LR: 0.020000
Training Epoch: 7 [27136/48750]	Loss: 0.0268	LR: 0.020000
Training Epoch: 7 [27392/48750]	Loss: 0.0438	LR: 0.020000
Training Epoch: 7 [27648/48750]	Loss: 0.0577	LR: 0.020000
Training Epoch: 7 [27904/48750]	Loss: 0.0622	LR: 0.020000
Training Epoch: 7 [28160/48750]	Loss: 0.0181	LR: 0.020000
Training Epoch: 7 [28416/48750]	Loss: 0.0666	LR: 0.020000
Training Epoch: 7 [28672/48750]	Loss: 0.0659	LR: 0.020000
Training Epoch: 7 [28928/48750]	Loss: 0.0424	LR: 0.020000
Training Epoch: 7 [29184/48750]	Loss: 0.0621	LR: 0.020000
Training Epoch: 7 [29440/48750]	Loss: 0.0799	LR: 0.020000
Training Epoch: 7 [29696/48750]	Loss: 0.0588	LR: 0.020000
Training Epoch: 7 [29952/48750]	Loss: 0.0545	LR: 0.020000
Training Epoch: 7 [30208/48750]	Loss: 0.0442	LR: 0.020000
Training Epoch: 7 [30464/48750]	Loss: 0.0650	LR: 0.020000
Training Epoch: 7 [30720/48750]	Loss: 0.0796	LR: 0.020000
Training Epoch: 7 [30976/48750]	Loss: 0.0958	LR: 0.020000
Training Epoch: 7 [31232/48750]	Loss: 0.0365	LR: 0.020000
Training Epoch: 7 [31488/48750]	Loss: 0.0508	LR: 0.020000
Training Epoch: 7 [31744/48750]	Loss: 0.0397	LR: 0.020000
Training Epoch: 7 [32000/48750]	Loss: 0.0259	LR: 0.020000
Training Epoch: 7 [32256/48750]	Loss: 0.0426	LR: 0.020000
Training Epoch: 7 [32512/48750]	Loss: 0.0501	LR: 0.020000
Training Epoch: 7 [32768/48750]	Loss: 0.0567	LR: 0.020000
Training Epoch: 7 [33024/48750]	Loss: 0.0359	LR: 0.020000
Training Epoch: 7 [33280/48750]	Loss: 0.0358	LR: 0.020000
Training Epoch: 7 [33536/48750]	Loss: 0.0367	LR: 0.020000
Training Epoch: 7 [33792/48750]	Loss: 0.0577	LR: 0.020000
Training Epoch: 7 [34048/48750]	Loss: 0.0448	LR: 0.020000
Training Epoch: 7 [34304/48750]	Loss: 0.0720	LR: 0.020000
Training Epoch: 7 [34560/48750]	Loss: 0.0406	LR: 0.020000
Training Epoch: 7 [34816/48750]	Loss: 0.0258	LR: 0.020000
Training Epoch: 7 [35072/48750]	Loss: 0.0318	LR: 0.020000
Training Epoch: 7 [35328/48750]	Loss: 0.0409	LR: 0.020000
Training Epoch: 7 [35584/48750]	Loss: 0.0717	LR: 0.020000
Training Epoch: 7 [35840/48750]	Loss: 0.0259	LR: 0.020000
Training Epoch: 7 [36096/48750]	Loss: 0.0498	LR: 0.020000
Training Epoch: 7 [36352/48750]	Loss: 0.0411	LR: 0.020000
Training Epoch: 7 [36608/48750]	Loss: 0.0462	LR: 0.020000
Training Epoch: 7 [36864/48750]	Loss: 0.0881	LR: 0.020000
Training Epoch: 7 [37120/48750]	Loss: 0.0386	LR: 0.020000
Training Epoch: 7 [37376/48750]	Loss: 0.0260	LR: 0.020000
Training Epoch: 7 [37632/48750]	Loss: 0.0515	LR: 0.020000
Training Epoch: 7 [37888/48750]	Loss: 0.0511	LR: 0.020000
Training Epoch: 7 [38144/48750]	Loss: 0.0514	LR: 0.020000
Training Epoch: 7 [38400/48750]	Loss: 0.0594	LR: 0.020000
Training Epoch: 7 [38656/48750]	Loss: 0.0510	LR: 0.020000
Training Epoch: 7 [38912/48750]	Loss: 0.0622	LR: 0.020000
Training Epoch: 7 [39168/48750]	Loss: 0.0297	LR: 0.020000
Training Epoch: 7 [39424/48750]	Loss: 0.0654	LR: 0.020000
Training Epoch: 7 [39680/48750]	Loss: 0.0604	LR: 0.020000
Training Epoch: 7 [39936/48750]	Loss: 0.0972	LR: 0.020000
Training Epoch: 7 [40192/48750]	Loss: 0.0297	LR: 0.020000
Training Epoch: 7 [40448/48750]	Loss: 0.0961	LR: 0.020000
Training Epoch: 7 [40704/48750]	Loss: 0.0272	LR: 0.020000
Training Epoch: 7 [40960/48750]	Loss: 0.0823	LR: 0.020000
Training Epoch: 7 [41216/48750]	Loss: 0.0586	LR: 0.020000
Training Epoch: 7 [41472/48750]	Loss: 0.0809	LR: 0.020000
Training Epoch: 7 [41728/48750]	Loss: 0.0661	LR: 0.020000
Training Epoch: 7 [41984/48750]	Loss: 0.0541	LR: 0.020000
Training Epoch: 7 [42240/48750]	Loss: 0.0608	LR: 0.020000
Training Epoch: 7 [42496/48750]	Loss: 0.0745	LR: 0.020000
Training Epoch: 7 [42752/48750]	Loss: 0.0398	LR: 0.020000
Training Epoch: 7 [43008/48750]	Loss: 0.0850	LR: 0.020000
Training Epoch: 7 [43264/48750]	Loss: 0.0785	LR: 0.020000
Training Epoch: 7 [43520/48750]	Loss: 0.0589	LR: 0.020000
Training Epoch: 7 [43776/48750]	Loss: 0.0868	LR: 0.020000
Training Epoch: 7 [44032/48750]	Loss: 0.0278	LR: 0.020000
Training Epoch: 7 [44288/48750]	Loss: 0.0443	LR: 0.020000
Training Epoch: 7 [44544/48750]	Loss: 0.0442	LR: 0.020000
Training Epoch: 7 [44800/48750]	Loss: 0.0568	LR: 0.020000
Training Epoch: 7 [45056/48750]	Loss: 0.0378	LR: 0.020000
Training Epoch: 7 [45312/48750]	Loss: 0.0318	LR: 0.020000
Training Epoch: 7 [45568/48750]	Loss: 0.0626	LR: 0.020000
Training Epoch: 7 [45824/48750]	Loss: 0.0521	LR: 0.020000
Training Epoch: 7 [46080/48750]	Loss: 0.0673	LR: 0.020000
Training Epoch: 7 [46336/48750]	Loss: 0.0752	LR: 0.020000
Training Epoch: 7 [46592/48750]	Loss: 0.0315	LR: 0.020000
Training Epoch: 7 [46848/48750]	Loss: 0.0800	LR: 0.020000
Training Epoch: 7 [47104/48750]	Loss: 0.1003	LR: 0.020000
Training Epoch: 7 [47360/48750]	Loss: 0.0752	LR: 0.020000
Training Epoch: 7 [47616/48750]	Loss: 0.0318	LR: 0.020000
Training Epoch: 7 [47872/48750]	Loss: 0.0538	LR: 0.020000
Training Epoch: 7 [48128/48750]	Loss: 0.0774	LR: 0.020000
Training Epoch: 7 [48384/48750]	Loss: 0.0583	LR: 0.020000
Training Epoch: 7 [48640/48750]	Loss: 0.0560	LR: 0.020000
Training Epoch: 7 [48750/48750]	Loss: 0.0396	LR: 0.020000
Epoch 7 - Average Train Loss: 0.0668, Train Accuracy: 0.9779
Epoch 7 training time consumed: 351.73s
Evaluating Network.....
Test set: Epoch: 7, Average loss: 0.0004, Accuracy: 0.9667, Time consumed:23.47s
Saving weights file to checkpoint/retrain/ViT/Sunday_20_July_2025_13h_32m_51s/ViT-Cifar10-seed10-ret75-7-best.pth
Training Epoch: 8 [256/48750]	Loss: 0.0484	LR: 0.020000
Training Epoch: 8 [512/48750]	Loss: 0.0440	LR: 0.020000
Training Epoch: 8 [768/48750]	Loss: 0.0405	LR: 0.020000
Training Epoch: 8 [1024/48750]	Loss: 0.0904	LR: 0.020000
Training Epoch: 8 [1280/48750]	Loss: 0.0194	LR: 0.020000
Training Epoch: 8 [1536/48750]	Loss: 0.0582	LR: 0.020000
Training Epoch: 8 [1792/48750]	Loss: 0.0253	LR: 0.020000
Training Epoch: 8 [2048/48750]	Loss: 0.0348	LR: 0.020000
Training Epoch: 8 [2304/48750]	Loss: 0.0760	LR: 0.020000
Training Epoch: 8 [2560/48750]	Loss: 0.0522	LR: 0.020000
Training Epoch: 8 [2816/48750]	Loss: 0.0603	LR: 0.020000
Training Epoch: 8 [3072/48750]	Loss: 0.0490	LR: 0.020000
Training Epoch: 8 [3328/48750]	Loss: 0.0445	LR: 0.020000
Training Epoch: 8 [3584/48750]	Loss: 0.0421	LR: 0.020000
Training Epoch: 8 [3840/48750]	Loss: 0.0418	LR: 0.020000
Training Epoch: 8 [4096/48750]	Loss: 0.0447	LR: 0.020000
Training Epoch: 8 [4352/48750]	Loss: 0.0356	LR: 0.020000
Training Epoch: 8 [4608/48750]	Loss: 0.0864	LR: 0.020000
Training Epoch: 8 [4864/48750]	Loss: 0.0397	LR: 0.020000
Training Epoch: 8 [5120/48750]	Loss: 0.0339	LR: 0.020000
Training Epoch: 8 [5376/48750]	Loss: 0.0626	LR: 0.020000
Training Epoch: 8 [5632/48750]	Loss: 0.0309	LR: 0.020000
Training Epoch: 8 [5888/48750]	Loss: 0.0732	LR: 0.020000
Training Epoch: 8 [6144/48750]	Loss: 0.0424	LR: 0.020000
Training Epoch: 8 [6400/48750]	Loss: 0.0397	LR: 0.020000
Training Epoch: 8 [6656/48750]	Loss: 0.0287	LR: 0.020000
Training Epoch: 8 [6912/48750]	Loss: 0.0347	LR: 0.020000
Training Epoch: 8 [7168/48750]	Loss: 0.0522	LR: 0.020000
Training Epoch: 8 [7424/48750]	Loss: 0.0428	LR: 0.020000
Training Epoch: 8 [7680/48750]	Loss: 0.0316	LR: 0.020000
Training Epoch: 8 [7936/48750]	Loss: 0.0426	LR: 0.020000
Training Epoch: 8 [8192/48750]	Loss: 0.0196	LR: 0.020000
Training Epoch: 8 [8448/48750]	Loss: 0.0414	LR: 0.020000
Training Epoch: 8 [8704/48750]	Loss: 0.0340	LR: 0.020000
Training Epoch: 8 [8960/48750]	Loss: 0.0603	LR: 0.020000
Training Epoch: 8 [9216/48750]	Loss: 0.0273	LR: 0.020000
Training Epoch: 8 [9472/48750]	Loss: 0.0311	LR: 0.020000
Training Epoch: 8 [9728/48750]	Loss: 0.0250	LR: 0.020000
Training Epoch: 8 [9984/48750]	Loss: 0.0272	LR: 0.020000
Training Epoch: 8 [10240/48750]	Loss: 0.0289	LR: 0.020000
Training Epoch: 8 [10496/48750]	Loss: 0.0743	LR: 0.020000
Training Epoch: 8 [10752/48750]	Loss: 0.0458	LR: 0.020000
Training Epoch: 8 [11008/48750]	Loss: 0.0289	LR: 0.020000
Training Epoch: 8 [11264/48750]	Loss: 0.0262	LR: 0.020000
Training Epoch: 8 [11520/48750]	Loss: 0.0434	LR: 0.020000
Training Epoch: 8 [11776/48750]	Loss: 0.0663	LR: 0.020000
Training Epoch: 8 [12032/48750]	Loss: 0.0427	LR: 0.020000
Training Epoch: 8 [12288/48750]	Loss: 0.0458	LR: 0.020000
Training Epoch: 8 [12544/48750]	Loss: 0.0265	LR: 0.020000
Training Epoch: 8 [12800/48750]	Loss: 0.0335	LR: 0.020000
Training Epoch: 8 [13056/48750]	Loss: 0.0510	LR: 0.020000
Training Epoch: 8 [13312/48750]	Loss: 0.0630	LR: 0.020000
Training Epoch: 8 [13568/48750]	Loss: 0.0128	LR: 0.020000
Training Epoch: 8 [13824/48750]	Loss: 0.0243	LR: 0.020000
Training Epoch: 8 [14080/48750]	Loss: 0.0594	LR: 0.020000
Training Epoch: 8 [14336/48750]	Loss: 0.0621	LR: 0.020000
Training Epoch: 8 [14592/48750]	Loss: 0.0333	LR: 0.020000
Training Epoch: 8 [14848/48750]	Loss: 0.0599	LR: 0.020000
Training Epoch: 8 [15104/48750]	Loss: 0.0309	LR: 0.020000
Training Epoch: 8 [15360/48750]	Loss: 0.0682	LR: 0.020000
Training Epoch: 8 [15616/48750]	Loss: 0.0340	LR: 0.020000
Training Epoch: 8 [15872/48750]	Loss: 0.0560	LR: 0.020000
Training Epoch: 8 [16128/48750]	Loss: 0.0374	LR: 0.020000
Training Epoch: 8 [16384/48750]	Loss: 0.0202	LR: 0.020000
Training Epoch: 8 [16640/48750]	Loss: 0.0272	LR: 0.020000
Training Epoch: 8 [16896/48750]	Loss: 0.0199	LR: 0.020000
Training Epoch: 8 [17152/48750]	Loss: 0.0421	LR: 0.020000
Training Epoch: 8 [17408/48750]	Loss: 0.0370	LR: 0.020000
Training Epoch: 8 [17664/48750]	Loss: 0.0356	LR: 0.020000
Training Epoch: 8 [17920/48750]	Loss: 0.0570	LR: 0.020000
Training Epoch: 8 [18176/48750]	Loss: 0.0445	LR: 0.020000
Training Epoch: 8 [18432/48750]	Loss: 0.0315	LR: 0.020000
Training Epoch: 8 [18688/48750]	Loss: 0.0404	LR: 0.020000
Training Epoch: 8 [18944/48750]	Loss: 0.0366	LR: 0.020000
Training Epoch: 8 [19200/48750]	Loss: 0.0318	LR: 0.020000
Training Epoch: 8 [19456/48750]	Loss: 0.0392	LR: 0.020000
Training Epoch: 8 [19712/48750]	Loss: 0.0296	LR: 0.020000
Training Epoch: 8 [19968/48750]	Loss: 0.0395	LR: 0.020000
Training Epoch: 8 [20224/48750]	Loss: 0.0564	LR: 0.020000
Training Epoch: 8 [20480/48750]	Loss: 0.0420	LR: 0.020000
Training Epoch: 8 [20736/48750]	Loss: 0.0821	LR: 0.020000
Training Epoch: 8 [20992/48750]	Loss: 0.0747	LR: 0.020000
Training Epoch: 8 [21248/48750]	Loss: 0.0279	LR: 0.020000
Training Epoch: 8 [21504/48750]	Loss: 0.0416	LR: 0.020000
Training Epoch: 8 [21760/48750]	Loss: 0.0426	LR: 0.020000
Training Epoch: 8 [22016/48750]	Loss: 0.0280	LR: 0.020000
Training Epoch: 8 [22272/48750]	Loss: 0.0914	LR: 0.020000
Training Epoch: 8 [22528/48750]	Loss: 0.0975	LR: 0.020000
Training Epoch: 8 [22784/48750]	Loss: 0.0597	LR: 0.020000
Training Epoch: 8 [23040/48750]	Loss: 0.0243	LR: 0.020000
Training Epoch: 8 [23296/48750]	Loss: 0.0223	LR: 0.020000
Training Epoch: 8 [23552/48750]	Loss: 0.0292	LR: 0.020000
Training Epoch: 8 [23808/48750]	Loss: 0.0167	LR: 0.020000
Training Epoch: 8 [24064/48750]	Loss: 0.0571	LR: 0.020000
Training Epoch: 8 [24320/48750]	Loss: 0.0292	LR: 0.020000
Training Epoch: 8 [24576/48750]	Loss: 0.0274	LR: 0.020000
Training Epoch: 8 [24832/48750]	Loss: 0.0311	LR: 0.020000
Training Epoch: 8 [25088/48750]	Loss: 0.0499	LR: 0.020000
Training Epoch: 8 [25344/48750]	Loss: 0.0497	LR: 0.020000
Training Epoch: 8 [25600/48750]	Loss: 0.0282	LR: 0.020000
Training Epoch: 8 [25856/48750]	Loss: 0.0299	LR: 0.020000
Training Epoch: 8 [26112/48750]	Loss: 0.0449	LR: 0.020000
Training Epoch: 8 [26368/48750]	Loss: 0.0338	LR: 0.020000
Training Epoch: 8 [26624/48750]	Loss: 0.0672	LR: 0.020000
Training Epoch: 8 [26880/48750]	Loss: 0.0292	LR: 0.020000
Training Epoch: 8 [27136/48750]	Loss: 0.0119	LR: 0.020000
Training Epoch: 8 [27392/48750]	Loss: 0.0458	LR: 0.020000
Training Epoch: 8 [27648/48750]	Loss: 0.0652	LR: 0.020000
Training Epoch: 8 [27904/48750]	Loss: 0.1036	LR: 0.020000
Training Epoch: 8 [28160/48750]	Loss: 0.0559	LR: 0.020000
Training Epoch: 8 [28416/48750]	Loss: 0.0493	LR: 0.020000
Training Epoch: 8 [28672/48750]	Loss: 0.0891	LR: 0.020000
Training Epoch: 8 [28928/48750]	Loss: 0.0153	LR: 0.020000
Training Epoch: 8 [29184/48750]	Loss: 0.0442	LR: 0.020000
Training Epoch: 8 [29440/48750]	Loss: 0.0449	LR: 0.020000
Training Epoch: 8 [29696/48750]	Loss: 0.0872	LR: 0.020000
Training Epoch: 8 [29952/48750]	Loss: 0.0454	LR: 0.020000
Training Epoch: 8 [30208/48750]	Loss: 0.0481	LR: 0.020000
Training Epoch: 8 [30464/48750]	Loss: 0.0314	LR: 0.020000
Training Epoch: 8 [30720/48750]	Loss: 0.0389	LR: 0.020000
Training Epoch: 8 [30976/48750]	Loss: 0.0385	LR: 0.020000
Training Epoch: 8 [31232/48750]	Loss: 0.0227	LR: 0.020000
Training Epoch: 8 [31488/48750]	Loss: 0.0296	LR: 0.020000
Training Epoch: 8 [31744/48750]	Loss: 0.0277	LR: 0.020000
Training Epoch: 8 [32000/48750]	Loss: 0.0757	LR: 0.020000
Training Epoch: 8 [32256/48750]	Loss: 0.0268	LR: 0.020000
Training Epoch: 8 [32512/48750]	Loss: 0.0611	LR: 0.020000
Training Epoch: 8 [32768/48750]	Loss: 0.0383	LR: 0.020000
Training Epoch: 8 [33024/48750]	Loss: 0.0492	LR: 0.020000
Training Epoch: 8 [33280/48750]	Loss: 0.0373	LR: 0.020000
Training Epoch: 8 [33536/48750]	Loss: 0.0328	LR: 0.020000
Training Epoch: 8 [33792/48750]	Loss: 0.0403	LR: 0.020000
Training Epoch: 8 [34048/48750]	Loss: 0.0294	LR: 0.020000
Training Epoch: 8 [34304/48750]	Loss: 0.0499	LR: 0.020000
Training Epoch: 8 [34560/48750]	Loss: 0.0816	LR: 0.020000
Training Epoch: 8 [34816/48750]	Loss: 0.0254	LR: 0.020000
Training Epoch: 8 [35072/48750]	Loss: 0.0465	LR: 0.020000
Training Epoch: 8 [35328/48750]	Loss: 0.1109	LR: 0.020000
Training Epoch: 8 [35584/48750]	Loss: 0.0263	LR: 0.020000
Training Epoch: 8 [35840/48750]	Loss: 0.0494	LR: 0.020000
Training Epoch: 8 [36096/48750]	Loss: 0.0203	LR: 0.020000
Training Epoch: 8 [36352/48750]	Loss: 0.0202	LR: 0.020000
Training Epoch: 8 [36608/48750]	Loss: 0.0267	LR: 0.020000
Training Epoch: 8 [36864/48750]	Loss: 0.0780	LR: 0.020000
Training Epoch: 8 [37120/48750]	Loss: 0.0341	LR: 0.020000
Training Epoch: 8 [37376/48750]	Loss: 0.0216	LR: 0.020000
Training Epoch: 8 [37632/48750]	Loss: 0.0239	LR: 0.020000
Training Epoch: 8 [37888/48750]	Loss: 0.0342	LR: 0.020000
Training Epoch: 8 [38144/48750]	Loss: 0.0449	LR: 0.020000
Training Epoch: 8 [38400/48750]	Loss: 0.0652	LR: 0.020000
Training Epoch: 8 [38656/48750]	Loss: 0.0265	LR: 0.020000
Training Epoch: 8 [38912/48750]	Loss: 0.0548	LR: 0.020000
Training Epoch: 8 [39168/48750]	Loss: 0.0204	LR: 0.020000
Training Epoch: 8 [39424/48750]	Loss: 0.0480	LR: 0.020000
Training Epoch: 8 [39680/48750]	Loss: 0.0441	LR: 0.020000
Training Epoch: 8 [39936/48750]	Loss: 0.0586	LR: 0.020000
Training Epoch: 8 [40192/48750]	Loss: 0.0266	LR: 0.020000
Training Epoch: 8 [40448/48750]	Loss: 0.1041	LR: 0.020000
Training Epoch: 8 [40704/48750]	Loss: 0.0284	LR: 0.020000
Training Epoch: 8 [40960/48750]	Loss: 0.0257	LR: 0.020000
Training Epoch: 8 [41216/48750]	Loss: 0.0471	LR: 0.020000
Training Epoch: 8 [41472/48750]	Loss: 0.0347	LR: 0.020000
Training Epoch: 8 [41728/48750]	Loss: 0.0311	LR: 0.020000
Training Epoch: 8 [41984/48750]	Loss: 0.0266	LR: 0.020000
Training Epoch: 8 [42240/48750]	Loss: 0.0210	LR: 0.020000
Training Epoch: 8 [42496/48750]	Loss: 0.0615	LR: 0.020000
Training Epoch: 8 [42752/48750]	Loss: 0.0425	LR: 0.020000
Training Epoch: 8 [43008/48750]	Loss: 0.0280	LR: 0.020000
Training Epoch: 8 [43264/48750]	Loss: 0.0279	LR: 0.020000
Training Epoch: 8 [43520/48750]	Loss: 0.0542	LR: 0.020000
Training Epoch: 8 [43776/48750]	Loss: 0.0489	LR: 0.020000
Training Epoch: 8 [44032/48750]	Loss: 0.0223	LR: 0.020000
Training Epoch: 8 [44288/48750]	Loss: 0.0453	LR: 0.020000
Training Epoch: 8 [44544/48750]	Loss: 0.0297	LR: 0.020000
Training Epoch: 8 [44800/48750]	Loss: 0.0174	LR: 0.020000
Training Epoch: 8 [45056/48750]	Loss: 0.0282	LR: 0.020000
Training Epoch: 8 [45312/48750]	Loss: 0.0637	LR: 0.020000
Training Epoch: 8 [45568/48750]	Loss: 0.0466	LR: 0.020000
Training Epoch: 8 [45824/48750]	Loss: 0.0335	LR: 0.020000
Training Epoch: 8 [46080/48750]	Loss: 0.0656	LR: 0.020000
Training Epoch: 8 [46336/48750]	Loss: 0.0603	LR: 0.020000
Training Epoch: 8 [46592/48750]	Loss: 0.0267	LR: 0.020000
Training Epoch: 8 [46848/48750]	Loss: 0.0349	LR: 0.020000
Training Epoch: 8 [47104/48750]	Loss: 0.0311	LR: 0.020000
Training Epoch: 8 [47360/48750]	Loss: 0.0224	LR: 0.020000
Training Epoch: 8 [47616/48750]	Loss: 0.0617	LR: 0.020000
Training Epoch: 8 [47872/48750]	Loss: 0.0448	LR: 0.020000
Training Epoch: 8 [48128/48750]	Loss: 0.0213	LR: 0.020000
Training Epoch: 8 [48384/48750]	Loss: 0.0243	LR: 0.020000
Training Epoch: 8 [48640/48750]	Loss: 0.0423	LR: 0.020000
Training Epoch: 8 [48750/48750]	Loss: 0.0039	LR: 0.020000
Epoch 8 - Average Train Loss: 0.0428, Train Accuracy: 0.9858
Epoch 8 training time consumed: 351.82s
Evaluating Network.....
Test set: Epoch: 8, Average loss: 0.0004, Accuracy: 0.9695, Time consumed:23.48s
Saving weights file to checkpoint/retrain/ViT/Sunday_20_July_2025_13h_32m_51s/ViT-Cifar10-seed10-ret75-8-best.pth
Valid (Test) Dl:  10000
Train Dl:  50000
Retain Train Dl:  48750
Forget Train Dl:  1250
Retain Valid Dl:  48750
Forget Valid Dl:  1250
retain_prob Distribution: 10000 samples
test_prob Distribution: 10000 samples
forget_prob Distribution: 1250 samples
Set1 Distribution: 1250 samples
Set2 Distribution: 1250 samples
Set1 Distribution: 1250 samples
Set2 Distribution: 1250 samples
Set1 Distribution: 10000 samples
Set2 Distribution: 10000 samples
Set1 Distribution: 10000 samples
Set2 Distribution: 10000 samples
Test Accuracy: 97.021484375
Retain Accuracy: 98.61817932128906
Zero-Retain Forget (ZRF): 0.7766534090042114
Membership Inference Attack (MIA): 0.8288
Forget vs Retain Membership Inference Attack (MIA): 0.496
Forget vs Test Membership Inference Attack (MIA): 0.548
Test vs Retain Membership Inference Attack (MIA): 0.503
Train vs Test Membership Inference Attack (MIA): 0.50675
Forget Set Accuracy (Df): 95.4950180053711
Method Execution Time: 5579.17 seconds
